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Diagnostics and Therapeutics for Diseases 
Associated with an IL-1 Inflammatory Haplotype 

1. Background of the Invention 

Genetics of the IL-1 Gene Cluster 

The IL-1 gene cluster is on the long arm of chromosome 2 (2ql3) and contains 
at least the genes for IL-1 a (IL-1 A), IL-1 (3 (IL-1B), and the IL-1 receptor antagonist (IL- 
1RN), within a region of 430 Kb (Nicklin, et al (1994) Genomics, 19: 382-4). The agonist 
molecules, IL-1 a and IL-1 p, have potent pro-inflammatory activity and are at the head of 
many inflammatory cascades. Their actions, often via the induction of other cytokines such as 
IL-6 and IL-8, lead to activation and recruitment of leukocytes into damaged tissue, local 
production of vasoactive agents, fever response in the brain and hepatic acute phase response. 
All three IL-1 molecules bind to type I and to type II IL-1 receptors, but only the type I 
receptor transduces a signal to the interior of the cell. In contrast, the type II receptor is shed 
from the cell membrane and acts as a decoy receptor. The receptor antagonist and the type II 
receptor, therefore, are both anti-inflammatory in their actions. 

Inappropriate production of IL-1 plays a central role in the pathology of many 
autoimmune and inflammatory diseases, including rheumatoid arthritis, inflammatory bowel 
disorder, psoriasis, and the like. In addition, there are stable inter-individual differences in 
the rates of production of IL-1, and some of this variation may be accounted for by genetic 
differences at IL-1 gene loci. Thus, the IL-1 genes are reasonable candidates for determining 
part of the genetic susceptibility to inflammatory diseases, most of which have a 
multifactorial etiology with a polygenic component. 

Certain alleles from the IL-1 gene cluster are known to be associated with 
particular disease states. For example, IL-1RN (VNTR) allele 2 has been shown to be 
associated with osteoporosis (U.S. Patent No. 5,698,399), nephropathy in diabetes mellitus 
(Blakemore, et al. (1996) Hum. Genet. 97(3): 369-74), alopecia areata (Cork, et al., (1995) J. 
Invest. Dermatol. 104(5 Supp.): 15S-16S;Cork etal. (1996) Dermatol Clin 14:671-8), 
Graves disease (Blakemore, et al. (1995) J. Clin. Endocrinol. 80(1): 1 1 1-5), systemic lupus 
erythematosus (Blakemore, et al. (1994) Arthritis Rheum. 37: 1380-85), lichen sclerosis 
(Clay, et al. (1994) Hum. Genet. 94: 407-10), and ulcerative colitis (Mansfield, et al. (1994) 
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Gastocntcrol. 106(3): 637-42)). 

In addition, the IL-1 A allele 2 from marker -889 and IL-1B (TaqI) allele 2 
from marker +3954 have been found to be associated with periodontal disease (U.S. Patent 
No. 5,686,246; Kornman and diGiovine (1998) Ann Periodont 3: 327-38; Hart and Kornman 
(1997)Periodontol 2000 14: 202-15; Newman (1997) Compend Contin Educ Dent 18: 881- 
4; Kornman et al. (1997) J. Clin Periodontol 24: 72-77). The IL-1 A allele 2 from marker - 
889 has also been found to be associated with juvenile chronic arthritis, particularly chronic 
iridocyclitis (McDowell, et al. (1995) Arthritis Rheum. 38: 221-28 ). The IL-1B (TaqI) allele 
2 from marker +3954 of IL-1 B has also been found to be associated with psoriasis and insulin 
dependent diabetes in DR3/4 patients (di Giovine, et al (1995) Cytokine 7: 606; Pociot, et al. 
(1992) Eur J. Clin. Invest. 22: 396-402). Additionally, the IL-IRN (VNTR) allele 1 has been 
found to be associated with diabetic retinopathy (see USSN 09/037472, and 
PCT/GB97/02790). Furthermore allele 2 of IL-IRN (VNTR) has been found to be associated 
with ulcerative colitis in Caucasian populations from North America and Europe (Mansfield, 
J. et al., (1994) Gastroenterology 106: 637-42). Interestingly, this association is particularly 
strong within populations of ethnically related Ashkenazi Jews (PCT W097/25445). 

Genotype Screening 

Traditional methods for the screening of heritable diseases have depended on 
either the identification of abnormal gene products (e.g., sickle cell anemia) or an abnormal 
phenotype (e.g., mental retardation). These methods are of limited utility for heritable 
diseases with late onset and no easily identifiable phenotypes such as, for example, vascular 
disease. With the development of simple and inexpensive genetic screening methodology, it 
is now possible to identify polymorphisms that indicate a propensity to develop disease, even 
when the disease is of polygenic origin. The number of diseases that can be screened by 
molecular biological methods continues to grow with increased understanding of the genetic 
basis of multifactorial disorders. 

Genetic screening (also called genotyping or molecular screening), can be 
broadly defined as testing to determine if a patient has mutations (alleles or polymorphisms) 
that either cause a disease state or are "linked" to the mutation causing a disease state. 
Linkage refers to the phenomenon th DNA sequences which are close together in the genome 
have a tendency to be inherited together. Two sequences may be linked because of some 
selective advantage of co-inheritance. More typically, however, two polymorphic sequences 
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are co-inherited because of the relative infrequency with which meiotic recombination events 
occur within the region between the two polymorphisms. The co-inherited polymorphic 
alleles are said to be in linkage disequilibrium with one another because, in a given human 
population, they tend to either both occur together or else not occur at all in any particular 
member of the population. Indeed, where multiple polymorphisms in a given chromosomal 
region are found to be in linkage disequilibrium with one another, they define a quasi-stable 
genetic "haplotype." In contrast, recombination events occurring between two polymorphic 
loci cause them to become separated onto distinct homologous chromosomes. If meiotic 
recombination between two physically linked polymorphisms occurs frequently enough, the 
two polymorphisms will appear to segregate independently and are said to be in linkage 
equilibrium. 



generally proportional to the physical distance between them on the chromosome, the 
occurrence of "hot spots" as well as regions of repressed chromosomal recombination can 
result in discrepancies between the physical and recombinational distance between two 
markers. Thus, in certain chromosomal regions, multiple polymorphic loci spanning a broad 
chromosomal domain may be in linkage disequilibrium with one another, and thereby define 
a broad-spanning genetic haplotype. Furthermore, where a disease-causing mutation is found 
within or in linkage with this haplotype, one or more polymorphic alleles of the haplotype 
can be used as a diagnostic or prognostic indicator of the likelihood of developing the 
disease. This association between otherwise benign polymorphisms and a disease-causing 
polymorphism occurs if the disease mutation arose in the recent past, so that sufficient time 
has not elapsed for equilibrium to be achieved through recombination events. Therefore 
identification of a human haplotype which spans or is linked to a disease-causing mutational 
change, serves as a predictive measure of an individual's likelihood of having inherited that 
disease-causing mutation. Importantly, such prognostic or diagnostic procedures can be 
utilized without necessitating the identification and isolation of the actual disease-causing 
lesion. This is significant because the precise determination of the molecular defect involved 
in a disease process can be difficult and laborious, especially in the case of multifactorial 
diseases such as inflammatory disorders. 

Indeed, the statistical correlation between an inflammatory disorder and an IL- 
1 polymorphism does not necessarily indicate that the polymorphism directly causes the 
disorder. Rather the correlated polymorphism may be a benign allelic variant which is linked 



While the frequency of meiotic recombination between two markers is 
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to (i.e. in linkage disequilibrium with) a disorder-causing mutation which has occurred in the 
recent human evolutionary past, so that sufficient time has not elapsed for equilibrium to be 
achieved through recombination events in the intervening chromosomal segment. Thus, for 
the purposes of diagnostic and prognostic assays for a particular disease, detection of a 
polymorphic allele associated with that disease can be utilized without consideration of 
whether the polymorphism is directly involved in the etiology of the disease. Furthermore, 
where a given benign polymorphic locus is in linkage disequilibrium with an apparent 
disease-causing polymorphic locus, still other polymorphic loci which are in linkage 
disequilibrium with the benign polymorphic locus are also likely to be in linkage 
disequilibrium with the disease-causing polymorphic locus. Thus these other polymorphic 
loci will also be prognostic or diagnostic of the likelihood of having inherited the disease- 
causing polymorphic locus. Indeed, a broad-spanning human haplotype (describing the 
typical pattern of co-inheritance of alleles of a set of linked polymorphic markers) can be 
targeted for diagnostic purposes once an association has been drawn between a particular 
disease or condition and a corresponding human haplotype. Thus, the determination of an 
individual's likelihood for developing a particular disease of condition can be made by 
characterizing one or more disease-associated polymorphic alleles (or even one or more 
disease-associated haplotypes) without necessarily determining or characterizing the 
causative genetic variation. 

2, Summary of the Invention 

In one aspect, the present invention provides novel methods and kits for 
determining whether a subject has or is predisposed to developing a disease or condition that 
is associated with an IL-1 polymorphism. In one embodiment, the method comprises 
determining whether the subject's nucleic acids contain a marker or allele comprising an IL-1 
inflammatory haplotype. Individuals with the 33221461 haplotype are typically over- 
producers of both IL1 genes and proteins, upon stimulation. In contrast, individuals with the 
441 12332 haplotype are typically under-producers of IL-lra. This second pattern, 441 12332 
may actually be pro-inflammatory in local tissues but anti-inflammatory in the systemic 
response, so that the 441 12332 pattern is associated with lower levels of IL-lra in certain 
local tissues but higher serum levels of IL-lra. 

An allele comprising an IL-1 inflammatory haplotype can be detected by any 
of a variety of available techniques, including: 1) performing a hybridization reaction 



-4- 



WO 01/00880 



PCT/US00/18318 



between a nucleic acid sample and a probe that is capable of hybridizing to the allele; 2) 
sequencing at least a portion of the allele; or 3) determining the electrophoretic mobility of 
the allele or fragments thereof (e.g., fragments generated by endonuclease digestion). The 
allele can optionally be subjected to an amplification step prior to performance of the 
detection step. Preferred amplification methods arc selected from the group consisting of: the 
polymerase chain reaction (PCR), the ligase chain reaction (LCR), strand displacement 
amplification (SDA), cloning, and variations of the above (e.g. RT-PCR and allele specific 
amplification). Oligonucleotides necessary for amplification may be selected, for example, 
from within the IL-1 gene loci, either flanking the marker of interest (as required for PCR 
amplification) or directly overlapping the marker (as in ASO hybridization). In a particularly 
preferred embodiment, the sample is hybridized with a set of primers, which hybridize 5' and 
3' in a sense or antisense sequence to the vascular disease associated allele, and is subjected to 
a PCR amplification. 

An allele comprising an IL-1 inflammatory haplotype may also be detected 
indirectly, e.g. by analyzing the protein product encoded by the DNA. For example, where 
the marker in question results in the translation of a mutant protein, the protein can be 
detected by any of a variety of protein detection methods. Such methods include 
immunodetection and biochemical tests, such as size fractionation, where the protein has a 
change in apparent molecular weight either through truncation, elongation, altered folding or 
altered post-translational modifications. 

In another aspect, the invention features kits for performing the above- 
described assays. The kit can include a nucleic acid sample collection means and a means for 
determining whether a subject carries at least one allele comprising an IL-1 inflammatory 
haplotype. The kit may also contain a control sample either positive or negative or a standard 
anaVor an algorithmic device for assessing the results and additional reagents and components 
including: DNA amplification reagents, DNA polymerase, nucleic acid amplification 
reagents, restrictive enzymes, buffers, a nucleic acid sampling device, DNA purification 
device, deoxynucleotides, oligonucleotides (e.g. probes and primers) etc.. 

As described above, the control may be a positive or negative control. 
Further, the control sample may contain the positive (or negative) products of the allele 
detection technique employed. For example, where the allele detection technique is PCR 
amplification, followed by size fractionation, the control sample may comprise DNA 
fragments of the appropriate size. Likewise, where the allele detection technique involves 
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detection of a mutated protein, the control sample may comprise a sample of mutated protein. 
However, it is preferred that the control sample comprises the material to be tested. For 
example, the controls may be a sample of genomic DNA or a cloned portion of the IL-1 gene 
cluster. Preferably, however, the control sample is a highly purified sample of genomic 
DNA where the sample to be tested is genomic DNA. 

The oligonucleotides present in said kit may be used for amplification of the 
region of interest or for direct allele specific oligonucleotide (ASO) hybridization to the 
markers in question. Thus, the oligonucleotides may either flank the marker of interest (as 
required for PCR amplification) or directly overlap the marker (as in ASO hybridization). 

Information obtained using the assays and kits described herein (alone or in 
conjunction with information on another genetic defect or environmental factor, which 
contributes to the disease or condition that is associated with an IL-1 inflammatory 
haplotype) is useful for determining whether a non-symptomatic subject has or is likely to 
develop the particular disease or condition. In addition, the information can allow a more 
customized approach to preventing the onset or progression of the disease or condition. For 
example, this information can enable a clinician to more effectively prescribe a therapy that 
will address the molecular basis of the disease or condition. 

In yet a further aspect, the invention features methods for treating or 
preventing the development of a disease or condition that is associated with an IL-1 
inflammatory haplotype in a subject by administering to the subject an appropriate 
therapeutic of the invention. In still another aspect, the invention provides in vitro or in vivo 
assays for screening test compounds to identify therapeutics for treating or preventing the 
development of a disease or condition that is associated with an IL-1 inflammatory haplotype. 
In one embodiment, the assay comprises contacting a cell transfected with a causative 
mutation that is operably linked to an appropriate promoter with a test compound and 
determining the level of expression of a protein in the cell in the presence and in the absence 
of the test compound. In a preferred embodiment, the causative mutation results in decreased 
production of IL-1 receptor antagonist, and increased production of the IL-1 receptor 
antagonist in the presence of the test compound indicates that the compound is an agonist of 
IL-1 receptor antagonist activity. In another preferred embodiment, the causative mutation 
results in increased production of IL-1 a or IL-1 (5 , and decreased production of IL-1 a or IL- 
1(3 in the presence of the test compound indicates that the compound is an antagonist of IL-1 a 
or IL-1 P activity. In another embodiment, the invention features transgenic non-human 
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animals and their use in identifying antagonists of IL-la or IL-lp activity or agonists of IL- 
lRa activity. 

Other embodiments and advantages of the invention are set forth in the 
following detailed description and claims. 

3. Brief Description of the Figures 

FIG. 1 is a schematic depiction of the IL-1 gene cluster including a few 
polymorphic markers. 

FIG. 2 is a graph which plots the correlation between disequilibrium values 

and 

physical distance as described herein. 
FIG. 3 shows the nucleic acid sequence for IL-1 A (GEN X03833; 
SEQ ID No. 1). 

FIG. 4 shows the nucleic acid sequence for IL-1B (GEN X04500; 
SEQ ID No. 2). 

FIG. 5 shows the nucleic acid sequence for the secreted IL-1RN (GEN 

X64532; 

SEQ ID No. 3). 

4. Detailed Description of the Invention 

4.1 Definitions 

For convenience, the meaning of certain terms and phrases employed in the 
specification, examples, and appended claims is provided below. 

The term "allele" refers to the different sequence variants found at different 
polymorphic regions. For example, IL-1RN (VNTR) has at least five different alleles. The 
sequence variants may be single or multiple base changes, including without limitation 
insertions, deletions, or substitutions, or may be a variable number of sequence repeats. 

The term "allelic pattern" refers to the identity of an allele or alleles at one or 
more polymorphic regions. For example, an allelic pattern may consist of a single allele at a 
polymorphic she, as for IL-1RN (VNTR) allele 1, which is an allelic pattern having at least 
one copy of IL-1 RN allele 1 at the VNTR of the IL-1RN gene loci. Alternatively, an allelic 
pattern may consist of either a homozygous or heterozygous state at a single polymorphic 
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site. For example, IL1-RN (VNTR) allele 2,2 is an allelic pattern in which there are two 
copies of the second allele at the VNTR marker of IL-tRN that corresponds to the 
homozygous IL-RN (VNTR) allele 2 state. Alternatively, an allelic pattern may consist of 
the identity of alleles at more than one polymorphic site. 

The term "antibody " as used herein is intended to refer to a binding agent 
including a whole antibody or a binding fragment thereof which is specifically reactive with 
an IL-1 polypeptide. Antibodies can be fragmented using conventional techniques and the 
fragments screened for utility in the same manner as described above for whole antibodies. 
For example, F(ab)2 fragments can be generated by treating an antibody with pepsin. The 
resulting F(ab)2 fragment can be treated to reduce disulfide bridges to produce Fab 
fragments. The antibody of the present invention is further intended to include bispecific, 
single-chain, and chimeric and humanized molecules having affinity for an IL-1 B 
polypeptide conferred by at least one CDR region of the antibody. 

"Biological activity" or "bioactivity" or "activity" or "biological function", 
which are used interchangeably, for the purposes herein means an effector or antigenic 
function that is directly or indirectly performed by an IL-1 polypeptide (whether in its native 
or denatured conformation), or by any subsequence thereof. Biological activities include 
binding to a target peptide, e.g., an IL-1 receptor. An IL-1 bioactivity can be modulated by 
directly affecting an IL-1 polypeptide. Alternatively, an IL-1 bioactivity can be modulated by 
modulating the level of an IL-l polypeptide, such as by modulating expression of an IL-l 
gene. 

As used herein the term "bioactive fragment of an IL-1 polypeptide" refers to a 
fragment of a full-length IL-1 polypeptide, wherein the fragment specifically mimics or 
antagonizes the activity of a wild-type IL-1 polypeptide. The bioactive fragment preferably 
is a fragment capable of interacting with an interleukin receptor. 

The term "an aberrant activity", as applied to an activity of a polypeptide such 
as IL-1, refers to an activity which differs from the activity of the wild-type or native 
polypeptide or which differs from the activity of the polypeptide in a healthy subject. An 
activity of a polypeptide can be aberrant because it is stronger than the activity of its native 
counterpart. Alternatively, an activity can be aberrant because it is weaker or absent relative 
to the activity of its native counterpart. An aberrant activity can also be a change in an 
activity. For example an aberrant polypeptide can interact with a different target peptide. A 
cell can have an aberrant IL-1 activity due to overcxpression or underexpression of an IL-1 
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locus gene encoding an IL-1 locus polypeptide. 

"Cells", "host cells" or "recombinant host cells" are terms used 
interchangeably herein to refer not only to the particular subject cell, but to the progeny or 
potential progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, in fact 
be identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A "chimera," "mosaic," "chimeric mammal" and the like, refers to a 
transgenic mammal with a knock-out or knock-in construct in at least some of its genome- 
containing cells. 

The terms "control" or "control sample" refer to any sample appropriate to the 
detection technique employed. The control sample may contain the products of the allele 
detection technique employed or the material to be tested. Further, the controls may be 
positive or negative controls. By way of example, where the allele detection technique is 
PCR amplification, followed by size fractionation, the control sample may comprise DNA 
fragments of an appropriate size. Likewise, where the allele detection technique involves 
detection of a mutated protein, the control sample may comprise a sample of a mutant 
protein. However, it is preferred that the control sample comprises the material to be tested. 
For example, the controls may be a sample of genomic DNA or a cloned portion of the IL-1 
gene cluster. However, where the sample to be tested is genomic DNA, the control sample is 
preferably a highly purified sample of genomic DNA. 

The phrase "diseases and conditions associated with IL-1 polymorphisms" 
refers to a variety of diseases or conditions, the susceptibility to which can be indicated in a 
subject based on the identification of one or more alleles within the IL-1 complex. Examples 
include: inflammatory or degenerative disease, including: Systemic Inflammatory Response 
(SIRS); Alzheimer's Disease (and associated conditions and symptoms including; chronic 
neuroinflammation, glial activation; increased microglia; neuritic plaque formation; and 
response to therapy); Amylotropic Lateral Sclerosis (ALS), arthritis (and associated 
conditions and symptoms including: acute joint inflammation, antigen-induced arthritis, 
arthritis associated with chronic lymphocytic thyroiditis, collagen-induced arthitis, juvenile 
chronic arthritis; juvenile rheumatoid arthritis, osteoarthritis, prognosis and 
streptococcus-induced arthritis), asthma (and associated conditions and symptoms, including: 
bronchial asthma; chronic obstructive airway disease; chronic obstructive pulmonary disease, 
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juvenile asthma and occupational asthma); cardiovascular diseases (and associated conditions 
and symptoms, including atherosclerosis; autoimmune myocarditis, chronic cardiac hypoxia, 
congestive heart failure, coronary artery disease, cardiomyopathy and cardiac cell 
dysfunction, including: aortic smooth muscle cell activation; cardiac cell apoptosis; and 
immunomodulation of cardiac cell function; diabetes and associated conditions and 
symptoms, including autoimmune diabetes, insulin-dependent (Type 1) diabetes, diabetic 
periodontitis, diabetic retinopathy, and diabetic nephropathy); gastrointestinal inflammations 
(and related conditions and symptoms, including celiac disease, associated osteopenia, 
chronic colitis, Crohn's disease, inflammatory bowel disease and ulcerative colitis); gastric 
ulcers; hepatic inflammations, cholesterol gallstones and hepatic fibrosis, HIV infection (and 
associated conditions and symptoms, including degenerative responses, neurodegenerative 
responses, and HIV associated Hodgkin's Disease), Kawasaki's Syndrome (and associated 
diseases and conditions, including mucocutaneous lymph node syndrome, cervical 
lymphadenopathy, coronary artery lesions, edema, fever, increased leukocytes, mild anemia, 
skin peeling, rash, conjunctiva redness, thrombocytosis; multiple sclerosis, nephropathies 
(and associated diseases and conditions, including diabetic nephropathy, endstage renal 
disease, glomerulonephritis, Goodpasture's syndrome, hemodialysis survival and renal 
ischemic reperfusion injury), neurodegenerative diseases (and associated diseases and 
conditions, including acute neurodegeneration, induction of IL-1 in aging and 
neurodegenerative disease, IL-1 induced plasticity of hypothalamic neurons and chronic 
stress hyperresponsiveness), Ophthalmopathies (and associated diseases and conditions, 
including diabetic retinopathy, Graves' Ophthalmopathy, and uveitis, osteoporosis (and 
associated diseases and conditions, including alveolar, femoral, radial, vertebral or wrist bone 
loss or fracture incidence, postmenopausal bone loss, mass, fracture incidence or rate of bone 
loss), otitis media (adult or pediatric), pancreatis or pancreatic acinitis, periodontal disease 
(and associated diseases and conditions, including adult, early onset and diabetic); pulmonary 
diseases, including chronic lung disease, chronic sinusitis, hyaline membrane disease, 
hypoxia and pulmonary disease in SIDS; restenosis; rheumatism including rheumatoid 
arthritis , rheumatic aschoff bodies, rheumatic diseases and rheumatic myocarditis; thyroiditis 
including chronic lymphocytic thyroiditis;urinary tract infections including chronic 
prostatitis, chronic pelvic pain syndrome and urolithiasis. Immunological disorders, 
including autoimmune diseases, such as alopecia aerata, autoimmune myocarditis, Graves' 
disease, Graves ophthalmopathy, lichen sclerosis, multiple sclerosis, psoriasis, systemic lupus 
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erythematosus, systemic sclerosis, thyroid diseases (e.g.goiter and struma lymphomatosa 
(Hashimoto's thyroiditis, lymphadenoid goiter), sleep disorders and chronic fatigue syndrome 
and obesity (non-diabetic or associated with diabetes). Resistance to infectious diseases, such 
as Leishmaniasis, Leprosy, Lyme Disease, Lyme Carditis, malaria, cerebral malaria, 
meningititis, tubulointestitial nephritis associated with malaria), which are caused by bacteria, 
viruses (e.g. cytomegalovirus, encephalitis, Epstein-Barr Virus, Human Immunodeficiency 
Virus, Influenza Virus) or protozoans (e.g., Plasmodium falciparum, trypanosomes). 
Response to trauma, including cerebral trauma (including strokes and ischemias, encephalitis, 
encephalopathies, epilepsy, perinatal brain injury, prolonged febrile seizures, SIDS and 
subarachnoid hemorrhage), low birth weight (e.g. cerebral palsy), lung injury (acute 
hemorrhagic lung injury, Goodpasture's syndrome, acute ischemic reperfusion), myocardial 
dysfunction, caused by occupational and environmental pollutants (e.g. susceptibility to toxic 
oil syndrome silicosis), radiation trauma, and efficiency of wound healing responses (e.g. 
bum or thermal wounds, chronic wounds, surgical wounds and spinal cord injuries). 
Susceptibility to neoplasias, including breast cancer associated osteolytic metastasis, 
cachexia, colorectal cancer, hyperproliferative diseases, Hodgkin's disease, leukemias, 
lymphomas, metabolic diseases and tumors, metastases, myeolomas, and various cancers 
(including breast prostate ovarian, colon, lung, etc), anorexia and cachexia. Hormonal 
regulation including fertility/fecundity, likelihood of a pregnancy, incidence of preterm labor, 
prenatal and neonatal complications including preterm low birth weight, cerebral palsy, 
septicemia, hypothyroxinemia, oxygen dependence, cranial abnormality, early onset 
menopause. A subject's response to transplant (rejection or acceptance), acute phase response 
(e.g. febrile response), general inflammatory response, acute respiratory distress response, 
acute systemic inflammatory response, wound healing, adhesion, immunoinflammatory 
response, neuroendocrine response, fever development and resistance, acute-phase response, 
stress response, disease susceptibility, repetitive motion stress, tennis elbow, and pain 
management and response. 

The phrases "disruption of the gene" and "targeted disruption" or any similar 
phrase refers to the site specific interruption of a native DNA sequence so as to prevent 
expression of that gene in the cell as compared to the wild-type copy of the gene. The 
interruption may be caused by deletions, insertions or modifications to the gene, or any 
combination thereof. 

The term "haplotype" as used herein is intended to refer to a set of alleles that 
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are inherited together as a group (axe in linkage disequilibrium) at statistically significant 
levels (p corT < 0.05). As used herein, the phrase "an IL-1 haplotype" refers to a haplotype in 
the IL-1 loci. An IL-1 inflammatory or proinflammatory haplotype refers to a haplotype that 
is indicative of increased agonist and/or decreased antagonist activities. 

The terms "IL-1 gene cluster" and "IL-1 loci" as used herein include all the 
nucleic acid at or near the 2ql3 region of chromosome 2, including at least the IL-1 A, IL-1B 
and IL-1RN genes and any other linked sequences. (Nicklin et al., Genomics 19: 382-84, 
1994). The terms "IL-1 A", "IL-1B", and "IL-1RN" as used herein refer to the genes coding 
for IL-1 , IL-1 , and IL-1 receptor antagonist, respectively. The gene accession number for 
IL-1A, IL-1B, and IL-1RN are X03833, X04500, and X64532, respectively. 

"IL-1 functional mutation" refers to a mutation within the IL-1 gene cluster 
that results in an altered phenotype (i.e. effects the function of an IL-1 gene or protein). 
Examples include: IL-lA(+4845) allele 2, IL-1B ( + 3954) allele 2, IL-1B (+6912) allele 2 
and IL-1RN (+2018) allele 2. 

"IL-1X (Z) allele Y " refers to a particular allelic form, designated Y, 
occurring at an IL-1 locus polymorphic site in gene X, wherein X is IL-1 A, B, or RN and 
positioned at or near nucleotide Z, wherein nucleotide Z is numbered relative to the major 
transcriptional start site, which is nucleotide +1, of the particular IL-1 gene X. As further 
used herein, the term "IL-1X allele (Z)" refers to all alleles of an IL-1 polymorphic site in 
gene X positioned at or near nucleotide Z. For example, the term "IL-1RN (+2018) allele" 
refers to alternative forms of the IL-1RN gene at marker t-2018. "IL-1RN (^2018) allele 1" 
refers to a form of the IL-1RN gene which contains a cytosine (C) at position +2018 of the 
sense strand. Clay et al., Hum. Genet. 97:723-26, 1996. "IL-1RN (+2018) allele 2" refers to 
a form of the IL-1RN gene which contains a thymine (T) at position +2018 of the plus strand. 
When a subject has two identical IL-1RN alleles, the subject is said to be homozygous, or to 
have the homozygous state. When a subject has two different IL-1RN alleles, the subject is 
said to be heterozygous, or to have the heterozygous state. The term "IL-1RN (+2018) allele 
2,2" refers to the homozygous IL-1 RN (+2018) allele 2 state. Conversely, the term "IL-1RN 
(+2018) allele 1,1" refers to the homozygous IL-1 RN (+2018) allele 1 state. The term "1L- 
1RN (+2018) allele 1,2" refers to the heterozygous allele 1 and 2 state. 

"IL-1 related" as used herein is meant to include all genes related to the human 
IL-1 locus genes on human chromosome 2 (2q 12-14). These include IL-1 genes of the 
human IL-1 gene cluster located at chromosome 2 (2q 13-14) which include: the IL-1 A gene 
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which encodes interleukin-la, the IL-1B gene which encodes interleukin-l(3, and the IL-1RN 
(or IL-lra) gene which encodes the imerleukin-1 receptor antagonist. Furthermore these IL-1 
related genes include the type I and type II human IL-1 receptor genes located on human 
chromosome 2 (2ql2) and their mouse homologs located on mouse chromosome 1 at position 
19.5 cM. Interleukin-la, interleukin- ip, and interleukin-IRN are related in so much as they 
all bind to IL-1 type I receptors, however only interleukin-la and interleukin- 1 P are agonist 
ligands which activate IL-1 type I receptors, while interleukin-IRN is a naturally occurring 
antagonist ligand. Where the term "IL-1" is used in reference to a gene product or 
polypeptide, it is meant to refer to all gene products encoded by the interleukin- 1 locus on 
human chromosome 2 (2q 12-14) and their corresponding homologs from other species or 
functional variants thereof. The term IL-1 thus includes secreted polypeptides which promote 
an inflammatory response, such as IL-1 a and IL-lp, as well as a secreted polypeptide which 
antagonize inflammatory responses, such as IL-1 receptor antagonist and the IL-1 type II 
(decoy) receptor. 

An "IL-1 receptor 1 ' or "IL-IR" refers to various cell membrane bound protein 
receptors capable of binding to and/or transducing a signal from an IL-1 locus-encoded 
ligand. The term applies to any of the proteins which are capable of binding interleukin- 1 
(IL-1) molecules and, in their native configuration as mammalian plasma membrane proteins, 
presumably play a role in transducing the signal provided by IL-1 to a cell. As used herein, 
the term includes analogs of native proteins with IL-1 -binding or signal transducing activity. 
Examples include the human and murine IL-1 receptors described in U.S. Patent No. 
4,968,607. The term "IL-1 nucleic acid" refers to a nucleic acid encoding an IL-1 protein. 

An "IL-1 polypeptide" and "IL-1 protein'* are intended to encompass 
polypeptides comprising the amino acid sequence encoded by the IL-1 genomic DNA 
sequences shown in Figures 1, 2, and 3, or fragments thereof, and homologs thereof and 
include agonist and antagonist polypeptides. 

"Increased risk" refers to a statistically higher frequency of occurrence of the 
disease or condition in an individual carrying a particular polymorphic allele in comparison 
to the frequency of occurrence of the disease or condition in a member of a population that 
does not carry the particular polymorphic allele. 

The term "interact" as used herein is meant to include detectable relationships 
or associations (e.g. biochemical interactions) between molecules, such as interactions 
between protein-protein, protein-nucleic acid, nucleic acid-nucleic acid and protein-small 
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molecule or nucleic acid-small molecule in nature. 

The term "isolated" as used herein with respect to nucleic acids, such as DNA 
or RNA, refers to molecules separated from other DNAs, or RNAs, respectively, that arc 
present in the natural source of the macromolecule. For example, an isolated nucleic acid 
encoding one of the subject IL-1 polypeptides preferably includes no more than 10 kilobases 
(kb) of nucleic acid sequence which naturally immediately flanks the IL-1 gene in genomic 
DNA, more preferably no more than 5kb of such naturally occurring flanking sequences, and 
most preferably less than 1 .5kb of such naturally occurring flanking sequence. The term 
isolated as used herein also refers to a nucleic acid or peptide that is substantially free of 
cellular material, viral material, or culture medium when produced by recombinant DNA 
techniques, or chemical precursors or other chemicals when chemically synthesized. 
Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments which are not 
naturally occurring as fragments and would not be found in the natural state. The term 
"isolated" is also used herein to refer to polypeptides which are isolated from other cellular 
proteins and is meant to encompass both purified and recombinant polypeptides. 

A "knock-in" transgenic animal refers to an animal that has had a modified 
gene introduced into its genome and the modified gene can be of exogenous or endogenous 
origin. 

A "knock-out " transgenic animal refers to an animal in which there is partial 
or complete suppression of the expression of an endogenous gene (e.g, based on deletion of at 
least a portion of the gene, replacement of at least a portion of the gene with a second 
sequence, introduction of stop codons, the mutation of bases encoding critical amino acids, or 
the removal of an intron junction, etc.). 

A "knock-out construct" refers to a nucleic acid sequence that can be used to 
decrease or suppress expression of a protein encoded by endogenous DNA sequences in a 
cell. In a simple example, the knock-out construct is comprised of a gene, such as the IL-1RN 
gene, with a deletion in a critical portion of the gene, so that active protein cannot be 
expressed therefrom. Alternatively, a number of termination codons can be added to the 
native gene to cause early termination of the protein or an intron junction can be inactivated. 
In a typical knock-out construct, some portion of the gene is replaced with a selectable 
marker (such as the neo gene) so that the gene can be represented as follows: IL-1RN 5Vneo/ 
IL-1RN 3', where IL-IRN5' and IL-1RN 3', refer to genomic or cDNA sequences which are, 
respectively, upstream and downstream relative to a portion of the IL-1RN gene and where 
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neo refers to a neomycin resistance gene. In another knock-out construct, a second selectable 
marker is added in a flanking position so that the gene can be represented as: 
IL-lRN/neo/IL-lRN/TK, where TK is a thymidine kinase gene which can be added to either 
the IL-IRN5' or the IL-1RN3' sequence of the preceding construct and which further can be 
selected against (i.e. is a negative selectable marker) in appropriate media. This two-marker 
construct allows the selection of homologous recombination events, which removes the 
flanking TK marker, from non-homologous recombination events which typically retain the 
TK sequences. The gene deletion and/or replacement can be from the exons, introns, 
especially intron junctions, and/or the regulatory regions such as promoters. 

"Linkage disequilibrium" refers to co-inheritance of two alleles at 
frequencies greater than would be expected from the separate frequencies of occurrence of 
each allele in a given control population. The expected frequency of occurrence of two 
alleles that are inherited independently is the frequency of the first allele multiplied by the 
frequency of the second allele. Alleles that co-occur at expected frequencies are said to be 
in "linkage disequilibrium". The cause of linkage disequilibrium is often unclear. It can be 
due to selection for certain allele combinations or to recent admixture of genetically 
heterogeneous populations. In addition, in the case of markers that are very tightly linked 
to a disease gene, an association of an allele (or group of linked alleles) with the disease 
gene is expected if the disease mutation occurred in the recent past, so that sufficient time 
has not elapsed for equilibrium to be achieved through recombination events in the specific 
chromosomal region. When referring to allelic patterns that are comprised of more than 
one allele, a first allelic pattern is in linkage disequilibrium with a second allelic pattern if 
all the alleles that comprise the first allelic pattern are in linkage disequilibrium with at 
least one of the alleles of the second allelic pattern. An example of linkage disequilibrium 
is that which occurs between the alleles at the IL-1RN (+2018) and IL-1RN (VNTR) 
polymorphic sites. The two alleles at IL-1RN (+2018) are 100% in linkage disequilibrium 
with the two most frequent alleles of IL-1RN (VNTR), which are allele 1 and allele 2. 

The term " marker " refers to a sequence in the genome that is known to vary 
among individuals. For example, the IL-1RN gene has a marker that consists of a variable 
number of tandem repeats (VNTR). 

A "mutated gene" or "mutation" or "functional mutation" refers to an allelic 
form of a gene, which is capable of altering the phenotype of a subject having the mutated 
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gene relative to a subject which does not have the mutated gene. The altered phenotype 
caused by a mutation can be corrected or compensated for by certain agents. If a subject must 
be homozygous for this mutation to have an altered phenotype, the mutation is said to be 
recessive. If one copy of the mutated gene is sufficient to alter the phenotype of the subject, 
the mutation is said to be dominant. If a subject has one copy of the mutated gene and has a 
phenotype that is intermediate between that of a homozygous and that of a heterozygous 
subject (for that gene), the mutation is said to be co-dominant. 

A "non-human animal" of the invention includes mammals such as rodents, 
non-human primates, sheep, dogs, cows, goats, etc. amphibians, such a s members of the 
Xenopus genus, and transgenic avians (e.g. chickens, birds, etc.). The term "chimeric animal" 
is used herein to refer to animals in which the recombinant gene is found, or in which the 
recombinant gene is expressed in some but not all cells of the animal. The term "tissue- 
specific chimeric animal" indicates that one of the recombinant IL-1 genes is present and/or 
expressed or disrupted in some tissues but not others. The term "non-human mammal" refers 
to any member of the class Mammalia, except for humans. 

As used herein, the term "nucleic acid" refers to polynucleotides or 
oligonucleotides such as deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic 
acid (RNA). The term should also be understood to include, as equivalents, analogs of 
either RNA or DNA made from nucleotide analogs (e.g. peptide nucleic acids) and as 
applicable to the embodiment being described, single (sense or antisense) and double- 
stranded polynucleotides. 

The term "polymorphism" refers to the coexistence of more than one form of a 
gene or portion (e.g., allelic variant) thereof. A portion of a gene of which there are at least 
two different forms, i.e., two different nucleotide sequences, is referred to as a "polymorphic 
region of a gene". A specific genetic sequence at a polymorphic region of a gene is an allele. 
A polymorphic region can be a single nucleotide, the identity of which differs in different 
alleles. A polymorphic region can also be several nucleotides long. 

The term "propensity to disease," also "predisposition" or "susceptibility" 
to disease or any similar phrase, means that certain alleles are hereby discovered to be 
associated with or predictive of a subject's incidence of developing a particular disease 
(e.g. a vascular disease). The alleles are thus over-represented in frequency in individuals 
with disease as compared to healthy individuals. Thus, these alleles can be used to predict 
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disease even in pre-symptomatic or pre-diseased individuals. 

"Small molecule" as used herein, is meant to refer to a composition, which has 
a molecular weight of less than about 5kD and most preferably less than about 4kD. Small 
molecules can be nucleic acids, peptides, peptidomimetics, carbohydrates, lipids or other 
organic or inorganic molecules. 

As used herein, the term "specifically hybridizes" or "specifically detects" 
refers to the ability of a nucleic acid molecule to hybridize to at least approximately 6 
consecutive nucleotides of a sample nucleic acid. 

'Transcriptional regulatory sequence" is a generic term used throughout the 
specification to refer to DNA sequences, such as initiation signals, enhancers, and promoters, 
which induce or control transcription of protein coding sequences with which they are 
operably linked. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, 
e.g., one of the IL-1 polypeptides, or an antisense transcript thereto) which has been 
introduced into a cell. A transgene could be partly or entirely heterologous, i.e., foreign, to 
the transgenic animal or cell into which it is introduced, or, is homologous to an endogenous 
gene of the transgenic animal or cell into which it is introduced, but which is designed to be 
inserted, or is inserted, into the animal's genome in such a way as to alter the genome of the 
cell into which it is inserted (e.g., it is inserted at a location which differs from that of the 
natural gene or its insertion results in a knockout). A transgene can also be present in a cell 
in the form of an episome. A transgene can include one or more transcriptional regulatory 
sequences and any other nucleic acid, such as introns, that may be necessary for optimal 
expression of a selected nucleic acid. 

A "transgenic animal" refers to any animal, preferably a non-human mammal, 
bird or an amphibian, in which one or more of the cells of the animal contain heterologous 
nucleic acid introduced by way of human intervention, such as by transgenic techniques well 
known in the art. The nucleic acid is introduced into the cell, directly or indirectly by 
introduction into a precursor of the cell, by way of deliberate genetic manipulation, such as 
by microinjection or by infection with a recombinant virus. The term genetic manipulation 
does not include classical cross-breeding, or in vitro fertilization, but rather is directed to the 
introduction of a recombinant DNA molecule. This molecule may be integrated within a 
chromosome, or it may be extrachromosomally replicating DNA. In the typical transgenic 
animals described herein, the transgene causes cells to express a recombinant form of one of 
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an IL-1 polypeptide, e.g. either agonistic or antagonistic forms. However, transgenic animals 
in which the recombinant gene is silent are also contemplated, as for example, the FLP or 
CRE recombinase dependent constructs described below. Moreover, "transgenic animal" also 
includes those recombinant animals in which gene disruption of one or more genes is caused 
by human intervention, including both recombination and antisense techniques. The term is 
intended to include all progeny generations. Thus, the founder animal and all Fl, F2, F3, and 
so on, progeny thereof are included. 

The term "treating" as used herein is intended to encompass curing as well as 
ameliorating at least one symptom of a condition or disease. 

The term "vector" refers to a nucleic acid molecule, which is capable of 
transporting another nucleic acid to which it has been linked. One type of preferred vector is 
an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Preferred vectors 
are those capable of autonomous replication and/or expression of nucleic acids to which they 
are linked. Vectors capable of directing the expression of genes to which they are operatively 
linked are referred to herein as "expression vectors". In general, expression vectors of utility 
in recombinant DNA techniques are often in the form of "plasmids" which refer generally to 
circular double stranded DNA loops which, in their vector form are not bound to the 
chromosome. In the present specification, "plasmid" and "vector" are used interchangeably as 
the plasmid is the most commonly used form of vector. However, the invention is intended to 
include such other forms of expression vectors which serve equivalent functions and which 
become known in the art subsequently hereto. 

The term "wild-type allele" refers to an allele of a gene which, when present in 
two copies in a subject results in a wild-type phenotype. There can be several different wild- 
type alleles of a specific gene, since certain nucleotide changes in a gene may not affect the 
phenotype of a subject having two copies of the gene with the nucleotide changes. 

4.2 Predictive Medicine 

4.2. 1 . IL-1 Inflammatory Haplotypes and Their Association with Certain 
Diseases or Conditions. 

The present invention is based at least in part, on the identification of certain 
inflammatory haplotype patterns and the association (to a statistically significant extent) of 
these patterns with the development of certain diseases or conditions. Therefore, detection of 
the alleles comprising a haplotype, alone or in conjunction with another means in a subject 
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can indicate that the subject has or is predisposed to the development of a particular disease 
or condition. However, because these alleles are in linkage disequilibrium with other alleles, 
the detection of such other linked alleles can also indicate that the subject has or is 
predisposed to the development of a particular disease or condition. For example, the 
441 12332 haplotype comprises the following genotype: 



allele 4 of the 222/223 marker of IL-1A 

allele 4 of the gz5/gz6 marker of IL-1 A 

allele 1 of the -889 marker of IL-1 A 

allele 1 of the +3954 marker of IL-1B 

allele 2 of the -5 1 1 marker of IL- 1 B 

allele 3 of the gaat.p33330 marker 

allele 3 of the Y31 marker 

allele2of+2018ofIL-lRN 

allele 1 of +4845 of IL-1A 

allele 2 of the VNTR marker of IL-1 RN 

Three other polymorphisms in an IL-1RN alternative exon (Exon lie, which 
produces an intracellular form of the gene product) are also in linkage disequilibrium with 
allele 2 of IL-1RN (VNTR) (Clay et al., (1996) Hum Genet 97:723-26). These include: IL- 
1RN exon lie (1812) (GenBank:X77090 at 1812); the IL-1RN exon lie (1868) 
polymorphism (GenBank:X77090 at 1868); and the IL-1RN exon lie (1887) polymorphism 
(GenBank:X77090 at 1887). Furthermore yet another polymorphism in the promoter for the 
alternatively spliced intracellular form of the gene, the Pic (1731) polymorphism 
(GenBank:X77090 at 1731), is also in linkage disequilibrium with allele 2 of the IL-1RN 
(VNTR) polymorphic locus. For each of these polymorphic loci, the allele 2 sequence 
variant has been determined to be in linkage disequilibrium with allele 2 of the IL-1RN 
(VNTR) locus (Clay et al., (1996) Hum Genet 97:723-26). 
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The 33221461 haplotype comprises the following genotype: 



allele 3 of the 222/223 marker of IL-1 A 

allele 3 of the gz5/gz6 marker of IL-1 A 

allele 2 of the -889 marker of IL-1 A 

allele 2 of the +3954 marker of IL-1 B 

allele 1 of the -511 marker of IL-1B 

allele 4 of the gaat.p33330 marker 

allele 6 of the Y31 marker 

allele 1 of +2018 of IL-1 RN 

allele 2 of +4845 of IL-1 A 

allele 1 of the VNTR marker of IL-1 RN 

Individuals with the 441 12332 haplotype are typically overproducers of both 
IL-1 a and IL-lp proteins, upon stimulation. In contrast, individuals with the 33221461 
haplotype are typically underproduces of IL-1 ra. Each haplotype results in a net 
proinflammatory response. Each allele within a haplotype may have an effect, as well as a 
composite genotype effect. In addition, particular diseases may be associated with both 
haplotype patterns. 

The following Table 1 sets forth a number of genotype markers and various 
diseases and conditions to which these markers have been found to be associated to a 



statistically significant extent. 

TABLE 1 

Association Of IL-1 Haplotype Gene Markers With Certain Diseases 



GENOTYPE 


IL-1 A 


IL-1 A 


IL-1B 


IL-1B 


IL-1RN 




(-889) 


(+4845) 


(-511) 


(+3954) 


(+2018) 














DISEASE 












Periodontal Disease 


(*2) 


*2 




*2 




Coronary Artery Disease 






*2 




*2 


Atherosclerosis 












Osteoporosis 










*2 


Insulin dependent diabetes 












Diabetic retinopathy 










*1 


Endstage renal diseases 










(+) 


Diabetic nephropathy 










*2 


Hepatic fibrosis (Japanese 
alcoholics) 










(+) 


Alopecia areata 










♦2 



-20- 



WO 01/00880 



PCT/USOO/18318 



GENOTYPE 


IL-1A 


IL-1A 


IL-1B 


IL-1B 


IL-1RN 




(-889) 


(+4845) 


(-511) 


(+3954) 


(+2018) 


Graves' disease 










*2 


Graves' ophthalmopathy 










(-) 


Extrathyroid disease 










(+) 


Systemic Lupus 
Erythematosus 










*2 


Lichen Sclerosis 










*2 


Arthritis 










(+) 


Juvenile chronic arthritis 


*2 










Rheumatoid arthritis 










(+) 


Insulin dependent diabetes 








*2 


*2 VNTR 


Ulcerative colitis 










*2 


Asthma 






♦2 


*2 




Multiple sclerosis 








(*2) 


*2VNTR 


Menopause, early onset 










*2 



In addition to the allelic patterns described above, as described herein, one of 
skill in the an can readily identify other alleles (including polymorphisms and mutations) that 
are in linkage disequilibrium with an allele associated with a disease or disorder. For 
example, a nucleic acid sample from a first group of subjects without a particular disorder can 
be collected, as well as DNA from a second group of subjects with the disorder. The nucleic 
acid sample can then be compared to identify those alleles that are over-represented in the 
second group as compared with the first group, wherein such alleles are presumably 
associated with a disorder, which is caused or contributed to by inappropriate interleukin 1 
regulation. Alternatively, alleles that are in linkage disequilibrium with an allele that is 
associated with the disorder can be identified, for example, by genotyping a large population 
and performing statistical analysis to determine which alleles appear more commonly 
together than expected. Preferably the group is chosen to be comprised of genetically related 
individuals. Genetically related individuals include individuals from the same race, the same 
ethnic group, or even the same family. As the degree of genetic relatedness between a control 
group and a test group increases, so does the predictive value of polymorphic alleles which 
are ever more distantly linked to a disease-causing allele. This is because less evolutionary 
time has passed to allow polymorphisms which are linked along a chromosome in a founder 
population to redistribute through genetic cross-over events. Thus race-specific, ethnic- 
specific, and even family-specific diagnostic genotyping assays can be developed to allow for 
the detection of disease alleles which arose at ever more recent times in human evolution, 
e.g., after divergence of the major human races, after the separation of human populations 
into distinct ethnic groups, and even within the recent history of a particular family line. 

Linkage disequilibrium between two polymorphic markers or between one 
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polymorphic marker and a disease-causing mutation is a meta-stable state. Absent selective 
pressure or the sporadic linked reoccurrence of the underlying mutational events, the 
polymorphisms will eventually become disassociated by chromosomal recombination events 
and will thereby reach linkage equilibrium through the course of human evolution. Thus, the 
likelihood of finding a polymorphic allele in linkage disequilibrium with a disease or 
condition may increase with changes in at least two factors: decreasing physical distance 
between the polymorphic marker and the disease-causing mutation, and decreasing number of 
meiotic generations available for the dissociation of the linked pair. Consideration of the 
latter factor suggests that, the more closely related two individuals are, the more likely they 
will share a common parental chromosome or chromosomal region containing the linked 
polymorphisms and the less likely that this linked pair will have become unlinked through 
meiotic cross-over events occurring each generation. As a result, the more closely related 
two individuals are, the more likely it is that widely spaced polymorphisms may be co- 
inherited. Thus, for individuals related by common race, ethnicity or family, the reliability of 
ever more distantly spaced polymorphic loci can be relied upon as an indicator of inheritance 
of a linked disease-causing mutation. 

Appropriate probes may be designed to hybridize to a specific gene of the IL-1 
locus, such as IL-1 A, IL-1B or IL-1RN or a related gene. These genomic DNA sequences are 
shown in Figures 3, 4 and 5, respectively, and further correspond to SEQ ID Nos. 1 , 2 and 3, 
respectively. Alternatively, these probes may incorporate other regions of the relevant 
genomic locus, including intergenic sequences. Indeed the IL-1 region of human 
chromosome 2 spans some 400,000 base pairs and, assuming an average of one single 
nucleotide polymorphism every 1,000 base pairs, includes some 400 SNPs loci alone. Yet 
other polymorphisms available for use with the immediate invention are obtainable from 
various public sources. For example, the human genome database collects intragenic SNPs, 
is searchable by sequence and currently contains approximately 2,700 entries 
(http://hgbase.interactiva.de). Also available is a human polymorphism database maintained 
by the Massachusetts Institute of Technology (MIT SNP database 

(http://www.genome.wi.mit.edu/SNP/human/index.html)). From such sources SNPs as well 
as other human polymorphisms may be found. 

For example, examination of the IL-1 region of the human genome in any one 
of these databases reveals that the IL-1 locus genes are flanked by a centromere proximal 
polymorphic marker designated microsatellite marker AFM220ze3 at 127.4 cM 
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(centiMorgans) (see GenBank Acc. No. Z 17008) and a distal polymorphic marker designated 
microsatellite anchor marker AFM087xal at 127.9 cM (see GenBank Acc. No. Z16545). 
These human polymorphic loci are both CA dinucleotide repeat microsatellite 
polymorphisms, and, as such, show a high degree of heterozygosity in human populations. 
For example, one allele of AFM220ze3 generates a 211 bp PCR amplification product with a 
5' primer of the sequence TGTACCTAAGCCCACCCTTTAGAGC (SEQ ID No. 4) and a 3' 
primer of the sequence TGGCCTCCAGAAACCTCCAA (SEQ ID No. 5). Furthermore, one 
allele of AFM087xal generates a 177 bp PCR amplification product with a 5' primer of the 
sequence GCTGATATTCTGGTGGGAAA (SEQ ID No. 6) and a 3' primer of the sequence 
GGC AAGAGCAAAACTCTGTC (SEQ ID No. 7). Equivalent primers corresponding to 
unique sequences occurring 5' and 3* to these human chromosome 2 CA dinucleotide repeat 
polymorphisms will be apparent to one of skill in the art. Reasonable equivalent primers 
include those which hybridize within about 1 kb of the designated primer, and which further 
are anywhere from about 17 bp to about 27 bp in length. A general guideline for designing 
primers for amplification of unique human chromosomal genomic sequences is that they 
possess a melting temperature of at least about 50°C, wherein an approximate melting 
temperature can be estimated using the formula T mel[ = [2 x (# of A or T) + 4 x (# of G or C)]. 

A number of other human polymorphic loci occur between these two CA 
dinucleotide repeat polymorphisms and provide additional targets for determination of a 
prognostic allele in a family or other group of genetically related individuals. For example, 
the National Center for Biotechnology Information web site 

(www.ncbi.nlm.nih.gov/genemap/) lists a number of polymorphism markers in the region of 
the IL-1 locus and provides guidance in designing appropriate primers for amplification and 
analysis of these markers. 

Accordingly, the nucleotide segments of the invention may be used for their 
ability to selectively form duplex molecules with complementary stretches of human 
chromosome 2 q 12-13 or cDNAs from that region or to provide primers for amplification of 
DNA or cDNA from this region. The design of appropriate probes for this purpose requires 
consideration of a number of factors. For example, fragments having a length of between 10, 
15, or 18 nucleotides to about 20, or to about 30 nucleotides, will find particular utility. 
Longer sequences, e.g., 40, 50, 80, 90, 100, even up to full length, are even more preferred 
for certain embodiments. Lengths of oligonucleotides of at least about 18 to 20 nucleotides 
are well accepted by those of skill in the art as sufficient to allow sufficiently specific 
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hybridization so as to be useful as a molecular probe. Furthermore, depending on the 
application envisioned, one will desire to employ varying conditions of hybridization to 
achieve varying degrees of selectivity of probe towards target sequence. For applications 
requiring high selectivity, one will typically desire to employ relatively stringent conditions 
to form the hybrids. For example, relatively low salt and/or high temperature conditions, 
such as provided by 0.02 M-0.15M NaCl at temperatures of about 50° C to about 70* C. Such 
selective conditions may tolerate little, if any, mismatch between the probe and the template 
or target strand. 

Other alleles or other indicia of a disorder can be detected or monitored in a 
subject in conjunction with detection of the alleles described above, for example, identifying 
vessel wall thickness (e.g. as measured by ultrasound), or whether the subject smokes, drinks 
is overweight, is under stress or exercises. 

4.2.2 Detection of Alleles 

Many methods are available for detecting specific alleles at human 
polymorphic loci. The preferred method for detecting a specific polymorphic allele will 
depend, in part, upon the molecular nature of the polymorphism. For example, the various 
allelic forms of the polymorphic locus may differ by a single base-pair of the DNA. Such 
single nucleotide polymorphisms (or SNPs) are major contributors to genetic variation, 
comprising some 80% of all known polymorphisms, and their density in the human genome 
is estimated to be on average 1 per 1,000 base pairs. SNPs are most frequently biallelic- 
occurring in only two different forms (although up to four different forms of an SNP, 
corresponding to the four different nucleotide bases occurring in DNA, are theoretically 
possible). Nevertheless, SNPs are mutationally more stable than other polymorphisms, 
making them suitable for association studies in which linkage disequilibrium between 
markers and an unknown variant is used to map disease-causing mutations. In addition, 
because SNPs typically have only two alleles, they can be genotyped by a simple plus/minus 
assay rather than a length measurement, making them more amenable to automation. 

A variety of methods are available for detecting the presence of a particular 
single nucleotide polymorphic allele in an individual. Advancements in this field have 
provided accurate, easy, and inexpensive large-scale SNP genotyping. Most recently, for 
example, several new techniques have been described including dynamic allele-specific 
hybridization (DASH), microplate array diagonal gel electrophoresis (MADGE), 
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pyrosequencing, oligonucleotide-specific ligation, the TaqMan system as well as various 
DNA "chip" technologies such as the Affymetrix SNP chips. These methods require 
amplification of the target genetic region, typically by PCR. Still other newly developed 
methods, based on the generation of small signal molecules by invasive cleavage followed by 
mass spectrometry or immobilized padlock probes and rolling-circle amplification, might 
eventually eliminate the need for PCR. Several of the methods known in the art for detecting 
specific single nucleotide polymorphisms are summarized below. The method of the present 
invention is understood to include all available methods. 

Several methods have been developed to facilitate analysis of single 
nucleotide polymorphisms. In one embodiment, the single base polymorphism can be 
detected by using a specialized exonuclease-resistant nucleotide, as disclosed, e.g., in 
Mundy, C. R. (U.S. Pat. No.4,656,127). According to the method, a primer complementary 
to the allelic sequence immediately 3' to the polymorphic site is permitted to hybridize to a 
target molecule obtained from a particular animal or human. If the polymorphic site on the 
target molecule contains a nucleotide that is complementary to the particular 
exonuclease-resistant nucleotide derivative present, then that derivative will be incorporated 
onto the end of the hybridized pnmer. Such incorporation renders the primer resistant to 
exonuclease, and thereby permits its detection. Since the identity of the exonuclease-resistant 
derivative of the sample is known, a finding that the primer has become resistant to 
exonucleases reveals that the nucleotide present in the polymorphic site of the target molecule 
was complementary to that of the nucleotide derivative used in the reaction. This method has 
the advantage that it does not require the determination of large amounts of extraneous 
sequence data. 

In another embodiment of the invention, a solution-based method is used for 
determining the identity of the nucleotide of a polymorphic site. Cohen, D. et al. (French 
Patent 2,650,840; PCT Appln. No, WO91/02087). As in the Mundy method of U.S. Pat. No. 
4,656,127, a primer is employed that is complementary to allelic sequences immediately 3' to 
a polymorphic site. The method determines the identity of the nucleotide of that site using 
labeled dideoxynucleotide derivatives, which, if complementary to the nucleotide of the 
polymorphic site will become incorporated onto the terminus of the primer. 

An alternative method, known as Genetic Bit Analysis or GBA ™ is described 
by Goelet, P. et al. (PCT Appln. No. 92/15712). The method of Goelet, P. et al. uses mixtures 
of labeled terminators and a pnmer that is complementary to the sequence 3' to a 
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polymorphic site. The labeled terminator that is incorporated is thus determined by, and 
complementary to, the nucleotide present in the polymorphic site of the target molecule being 
evaluated. In contrast to the method of Cohen et al. (French Patent 2,650,840; PCT Appln. 
No. W09 1/02087) the method of Goelet, P. et al. is preferably a heterogeneous phase assay, 
in which the primer or the target molecule is immobilized to a solid phase. 

Recently, several primer-guided nucleotide incorporation procedures for 
assaying polymorphic sites in DNA have been described (Komhcr, J, S. et al., Nucl. Acids. 
Res. 17:7779-7784 (1989); Sokolov, B. P., Nucl. Acids Res. 18:3671 (1990); Syvanen, A. 
-C, et al., Genomics 8:684-692 (1990); Kuppuswamy, M. N. et al., Proc. Natl. Acad. Sci. 
(U.S.A.) 88:1 143-1 147 (1991); Prezant, T. R. et al., Hum. Mutat. 1:159-164 (1992); 
Ugozzoli, L. et al., GATA 9:107-1 12 (1992); Nyren, P. et al., Anal. Biochem. 208:171-175 
(1993)). These methods differ from GBA ™ in that they all rely on the incorporation of 
labeled deoxynucleotides to discriminate between bases at a polymorphic site. In such a 
format, since the signal is proportional to the number of deoxynucleotides incorporated, 
polymorphisms that occur in runs of the same nucleotide can result in signals that are 
proportional to the length of the run (Syvanen, A. -C, et al., Amer. J. Hum. Genet. 52:46-59 
(1993)). 

For mutations that produce premature termination of protein translation, the 
protein truncation test (PIT) offers an efficient diagnostic approach (Roest, et. al., (1993) 
Hum. Mol. Genet. 2:1719-21; van der Luijt, et. al., (1994) Genomics 20:1-4). For PTT, RNA 
is initially isolated from available tissue and reverse-transcribed, and the segment of interest 
is amplified by PCR. The products of reverse transcription PCR are then used as a template 
for nested PCR amplification with a primer that contains an RNA polymerase promoter and a 
sequence for initiating eukaryotic translation. After amplification of the region of interest, 
the unique motifs incorporated into the primer permit sequential in vitro transcription and 
translation of the PCR products. Upon sodium dodecyl sulfate-polyacryl amide gel 
electrophoresis of translation products, the appearance of truncated polypeptides signals the 
presence of a mutation that causes premature termination of translation. In a variation of this 
technique, DNA (as opposed to RNA) is used as a PCR template when the target region of 
interest is derived from a single exon. 

Any cell type or tissue may be utilized to obtain nucleic acid samples for use 
in the diagnostics described herein. In a preferred embodiment, the DNA sample is obtained 
from a bodily fluid, e.g, blood, obtained by known techniques (e.g. venipuncture) or saliva. 
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Alternatively, nucleic acid tests can be performed on dry samples (e.g. hair or skin). When 
using RNA or protein, the cells or tissues that may be utilized must express an IL-1 gene. 

Diagnostic procedures may also be performed in situ directly upon tissue 
sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections, such that 
no nucleic acid purification is necessary. Nucleic acid reagents may be used as probes and/or 
primers for such in situ procedures (see, for example, Nuovo, G.J., 1992, PCR in situ 
hybridization: protocols and applications, Raven Press, NY), 

In addition to methods which focus primarily on the detection of one nucleic 
acid sequence, profiles may also be assessed in such detection schemes. Fingerprint profiles 
may be generated, for example, by utilizing a differential display procedure, Northern 
analysis and/or RT-PCR. 

A preferred detection method is allele specific hybridization using probes 
overlapping a region of at least one allele of an IL-1 proinflammatory haplotype and having 
about 5, 10, 20, 25, or 30 nucleotides around the mutation or polymorphic region. In a 
preferred embodiment of the invention, several probes capable of hybridizing specifically to 
other allelic variants involved in a restenosis are attached to a solid phase support, e.g., a 
"chip" (which can hold up to about 250,000 oligonucleotides). Oligonucleotides can be 
bound to a solid support by a variety of processes, including lithography. Mutation detection 
analysis using these chips comprising oligonucleotides, also termed "DNA probe arrays" is 
described e.g., in Cronin et al. (1996) Human Mutation 7:244. In one embodiment, a chip 
comprises all the allelic variants of at least one polymorphic region of a gene. The solid 
phase support is then contacted with a test nucleic acid and hybridization to the specific 
probes is detected. Accordingly, the identity of numerous allelic variants of one or more 
genes can be identified in a simple hybridization experiment. 

These techniques may also comprise the step of amplifying the nucleic acid 
before analysis. Amplification techniques are known to those of skill in the art and include, 
but are not limited to cloning, polymerase chain reaction (PCR), polymerase chain reaction of 
specific alleles (ASA), ligase chain reaction (LCR), nested polymerase chain reaction, self 
sustained sequence replication (Guatelli, J.C. et al,, 1990, Proc. Natl. Acad. Sci. USA 
87:1874-1878), transcriptional amplification system (Kwoh, D.Y. et al., 1989, Proc. Natl. 
Acad. Sci. USA 86:1 173-1 177), and Q- Beta Repiicase (Lizardi, P.M. et al., 1988, 
Bio/Technology 6:1 197). 

Amplification products may be assayed in a variety of ways, including size 
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analysis, restriction digestion followed by size analysis, detecting specific tagged 
oligonucleotide primers in the reaction products, allele-specific oligonucleotide (ASO) 
hybridization, allele specific 5' exonuclease detection, sequencing, hybridization, and the like. 

PCR based detection means can include multiplex amplification of a plurality 
of markers simultaneously. For example, it is well known in the art to select PCR primers to 
generate PCR products that do not overlap in size and can be analyzed simultaneously. 
Alternatively, it is possible to amplify different markers with primers that are differentially 
labeled and thus can each be differentially detected. Of course, hybridization based detection 
means allow the differential detection of multiple PCR products in a sample. Other 
techniques are known in the art to allow multiplex analyses of a plurality of markers. 

In a merely illustrative embodiment, the method includes the steps of (i) 
collecting a sample of cells from a patient, (ii) isolating nucleic acid (e.g., genomic, mRNA 
or both) from the cells of the sample, (iii) contacting the nucleic acid sample with one or 
more primers which specifically hybridize 5' and 3' to at least one allele of an IL-1 
proinflammatory haplotype under conditions such that hybridization and amplification of the 
allele occurs, and (iv) detecting the amplification product. These detection schemes are 
especially useful for the detection of nucleic acid molecules if such molecules are present in 
very low numbers. 

In a preferred embodiment of the subject assay, the allele of an IL-1 
proinflammatory haplotype is identified by alterations in restriction enzyme cleavage 
patterns. For example, sample and control DNA is isolated, amplified (optionally), digested 
with one or more restriction endonucleases, and fragment length sizes are determined by gel 
electrophoresis. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the allele. Exemplary sequencing reactions include 
those based on techniques developed by Maxim and Gilbert ((1977) Proc. Natl Acad Sci USA 
74:560) or Sanger (Sanger et al (1977) Proc. Nat. Acad. Sci USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures may be utilized when 
performing the subject assays (see, for example Biotechniques (1995) 19:448), including 
sequencing by mass spectrometry (see, for example PCT publication WO 94/16101; Cohen et 
al. (1996) Adv Chromatogr 36:127-162; and Griffin et al. (1993) Appl Biochem Biotechnol 
38:147-159). It will be evident to one of skill in the art that, for certain embodiments, the 
occurrence of only one, two or three of the nucleic acid bases need be determined in the 
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sequencing reaction. For instance, A-track or the like, e.g., where only one nucleic acid is 
detected, can be carried out, 

In a further embodiment, protection from cleavage agents (such as a nuclease, 
hydroxylamine or osmium tetroxide and with piperidine) can be used to detect mismatched 
bases in RNA/RNA or RNA/DNA or DNA/DNA heteroduplexes (Myers, et al. (1985) 
Science 230:1242). In general, the art technique of "mismatch cleavage" starts by providing 
heteroduplexes formed by hybridizing (labeled) RNA or DNA containing the wild-type allele 
with the sample. The double- stranded duplexes are treated with an agent which cleaves 
single-stranded regions of the duplex such as which will exist due to base pair mismatches 
between the control and sample strands. For instance, RNA/DNA duplexes can be treated 
with RNase and DNA/DNA hybrids treated with SI nuclease to enzymatically digest the 
mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can 
be treated with hydroxylamine or osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched regions, the resulting material is then 
separated by size on denaturing poly aery lamide gels to determine the site of mutation. See, 
for example, Cotton et al (1988) Proc. Natl Acad Sci USA 85:4397; and Saleeba et al (1992) 
Methods Enzymol. 217:286-295. In a preferred embodiment, the control DNA or RNA can 
be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 
"DNA mismatch repair" enzymes). For example, the mutY enzyme of E. coli cleaves A at 
G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T 
mismatches (Hsu et al. (1994) Carcinogenesis 15:1657-1662). According to an exemplary 
embodiment, a probe based on an allele of an IL-1 locus haplotype is hybridized to a cDNA 
or other DNA product from a test cell(s). The duplex is treated with a DNA mismatch repair 
enzyme, and the cleavage products, if any, can be detected from electrophoresis protocols or 
the like. See, for example, U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to 
identify an IL-1 locus allele. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids (Orita et al. (1989) Proc Natl. Acad. Sci USA 86:2766, see also Cotton (1993) 
Mutat Res 285:125-144; and Hayashi (1992) Genet Anal Tech Appl 9:73-79). Single- 
stranded DNA fragments of sample and control IL-1 locus alleles are denatured and allowed 
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to renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of even a 
single base change. The DNA fragments may be labeled or detected with labeled probes. The 
sensitivity of the assay may be enhanced by using RNA (rather than DNA), in which the 
secondary structure is more sensitive to a change in sequence. In a preferred embodiment, 
the subject method utilizes heteroduplex analysis to separate double stranded heteroduplex 
molecules on the basis of changes in electrophoretic mobility (Keen et al. (1991) Trends 
Genet 7:5). 

In yet another embodiment, the movement of alleles in polyacrylamide gels 
containing a gradient of denaturant is assayed using denaturing gradient gel electrophoresis 
(DGGE) (Myers et al. (1985) Nature 313:495). When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not completely denature, for example by 
adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA by PCR. In a 
further embodiment, a temperature gradient is used in place of a denaturing agent gradient to 
identify differences in the mobility of control and sample DNA (Rosenbaum and Reissner 
(1987) Biophys Chem 265:12753), 

Examples of other techniques for detecting alleles include, but are not limited 
to, selective oligonucleotide hybridization, selective amplification, or selective primer 
extension. For example, oligonucleotide primers may be prepared in which the known 
mutation or nucleotide difference (e.g., in allelic variants) is placed centrally and then 
hybridized to target DNA under conditions which permit hybridization only if a perfect match 
is found (Saiki et al (1986) Nature 324:163); Saiki et al (1989) Proc. Natl Acad. Sci USA 
86:6230), Such allele specific oligonucleotide hybridization techniques may be used to test 
one mutation or polymorphic region per reaction when oligonucleotides are hybridized to 
PCR amplified target DNA or a number of different mutations or polymorphic regions when 
the oligonucleotides are attached to the hybridizing membrane and hybridized with labelled 
target DNA. 

Alternatively, allele specific amplification technology which depends on 
selective PCR amplification may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific amplification may carry the mutation or 
polymorphic region of interest in the center of the molecule (so that amplification depends on 
differential hybridization) (Gibbs et al (1989) Nucleic Acids Res. 17:2437-2448) or at the 
extreme 3' end of one primer where, under appropriate conditions, mismatch can prevent, or 
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reduce polymerase extension (Prossner (1993) Tibtech 1 1:238. In addition it may be 
desirable to introduce a novel restriction site in the region of the mutation to create cleavage- 
based detection (Gasparini et al (1992) Mol. Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, ligation will occur only if 
there is a perfect match at the 3' end of the 5' sequence making it possible to detect the 
presence of a known mutation at a specific site by looking for the presence or absence of 
amplification. 

In another embodiment, identification of the allelic variant is carried out using 
an oligonucleotide ligation assay (OLA), as described, e.g., in U.S. Pat. No. 4,998,617 and in 
Landegren, U. et al. ((1988) Science 241:1077-1080). The OLA protocol uses two 
oligonucleotides which are designed to be capable of hybridizing to abutting sequences of a 
single strand of a target. One of the oligonucleotides is linked to a separation marker, e.g.. 
biotinylated, and the other is detectably labeled. If the precise complementary sequence is 
found in a target molecule, the oligonucleotides will hybridize such that their termini abut, 
and create a ligation substrate. Ligation then permits the labeled oligonucleotide to be 
recovered using avidin, or another biotin ligand. Nickerson, D. A. et al. have described a 
nucleic acid detection assay that combines attributes of PCR and OLA (Nickerson, D. A. et 
al. (1990) Proc. Natl. Acad. Sci. USA 87:8923-27). In this method, PCR is used to achieve 
the exponential amplification of target DNA, which is then detected using OLA. 

Several techniques based on this OLA method have been developed and can 
be used to detect alleles of an 1L-1 locus haplotype. For example, U.S. Patent No. 5,593,826 
discloses an OLA using an oligonucleotide having 3'-amino group and a 5'-phosphorylated 
oligonucleotide to form a conjugate having a phospho rami date linkage. In another variation 
of OLA described in Tobe et al. ((1996) Nucleic Acids Res 24: 3728), OLA combined with 
PCR permits typing of two alleles in a single microtiter well. By marking each of the 
allele-specific primers with a unique hapten, i.e. digoxigenin and fluorescein, each OLA 
reaction can be detected by using hapten specific antibodies that are labeled with different 
enzyme reporters, alkaline phosphatase or horseradish peroxidase. This system permits the 
detection of the two alleles using a high throughput format that leads to the production of two 
different colors. 

Another embodiment of the invention is directed to kits for detecting a 
predisposition for developing a restenosis. This kit may contain one or more 
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oligonucleotides, including 5' and 3' oligonucleotides that hybridize 5' and 3' to at least one 
allele of an IL-1 locus haplotype. PCR amplification oligonucleotides should hybridize 
between 25 and 2500 base pairs apart, preferably between about 100 and about 500 bases 
apart, in order to produce a PCR product of convenient size for subsequent analysis. 

Particularly preferred primers for use in the diagnostic method of the invention 
include SEQ ID Nos. 1-6 

The design of additional oligonucleotides for use in the amplification and 
detection of IL-1 polymorphic alleles by the method of the invention is facilitated by the 
availability of both updated sequence information from human chromosome 2ql3 - which 
contains the human IL-1 locus, and updated human polymorphism information available for 
this locus. For example, the DNA sequence for the IL-1 A, IL-1B and IL-1RN is shown in 
Figures 1 (GenBank Accession No. X03833), 2 (GenBank Accession No. X04500) and 3 
(GenBank Accession No. X64532) respectively. Suitable primers for the detection of a 
human polymorphism in these genes can be readily designed using this sequence information 
and standard techniques known in the art for the design and optimization of primers 
sequences. Optimal design of such primer sequences can be achieved, for example, by the 
use of commercially available primer selection programs such as Primer 2.1, Primer 3 or 
GeneFisher (See also, Nicklin M.HJ., Weith A. Duff G.W., "A Physical Map of the Region 
Encompassing the Human Interleukin-la, interleukin-1 p, and Interleukin-1 Receptor 
Antagonist Genes" Genomics 19: 382 (1995); Nothwang H.G., et al. "Molecular Cloning of 
the Interleukin-1 gene Cluster: Construction of an Integrated YAC/PAC Contig and a partial 
transcriptional Map in the Region of Chromosome 2ql3" Genomics 41 : 370 (1997); Clark, et 
al. (1986) Nucl. Acids. Res., 14:7897-7914 [published erratum appears in Nucleic Acids Res., 
15:868 (1987) and the Genome Database (GDB) project at the URL http://www.gdb.org). 

For use in a kit, oligonucleotides may be any of a variety of natural and/or 
synthetic compositions such as synthetic oligonucleotides, restriction fragments, cDNAs, 
synthetic peptide nucleic acids (PNAs), and the like. The assay kit and method may also 
employ labeled oligonucleotides to allow ease of identification in the assays. Examples of 
labels which may be employed include radio-labels, enzymes, fluorescent compounds, 
streptavidin, avidin, biotin, magnetic moieties, metal binding moieties, antigen or antibody 
moieties, and the like. 

The kit may, optionally, also include DNA sampling means. DNA sampling 
means are well known to one of skill in the art and can include, but not be limited to 
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substrates, such as filter papers, the AmpliCard™ (University of Sheffield, Sheffield, 
England S10 2JF; Tarlow, JW, et ai, J. of Invest. Dermatol. 103:387-389 (1994)) and the 
like; DNA purification reagents such as Nucleon™ kits, lysis buffers, proteinase solutions 
and the like; PCR reagents, such as I Ox reaction buffers, thermostable polymerase, dNTPs, 
and the like; and allele detection means such as the Hinfi restriction enzyme, allele specific 
oligonucleotides, degenerate oligonucleotide primers for nested PCR from dried blood. 

4.2.3. Pharmacogenomics 

Knowledge of the particular alleles associated with a susceptibility to 
developing a particular disease or condition, alone or in conjunction with information on 
other genetic defects contributing to the particular disease or condition allows a 
customization of the prevention or treatment in accordance with the individual's genetic 
profile, the goal of "pharmacogenomics". Thus, comparison of an individual's IL-1 profile to 
the population profile for a vascular disorder, permits the selection or design of drugs or other 
therapeutic regimens that are expected to be safe and efficacious for a particular patient or 
patient population (i.e., a group of patients having the same genetic alteration). 

In addition, the ability to target populations expected to show the highest 
clinical benefit, based on genetic profile can enable: 1) the repositioning of already marketed 
drugs; 2) the rescue of drug candidates whose clinical development has been discontinued as 
a result of safety or efficacy limitations, which are patient subgroup-specific; and 3) an 
accelerated and less costly development for candidate therapeutics and more optimal drug 
labeling (e.g. since measuring the effect of various doses of an agent on the causative 
mutation is useful for optimizing effective dose). 

The treatment of an individual with a particular therapeutic can be monitored 
by determining protein (e.g. IL-1 a, IL-lp, or IL-IRa), mRNA and/or transcriptional level. 
Depending on the level detected, the therapeutic regimen can then be maintained or adjusted 
(increased or decreased in dose). In a preferred embodiment, the effectiveness of treating a 
subject with an agent comprises the steps of: (i) obtaining a preadministration sample from a 
subject prior to administration of the agent; (ii) detecting the level or amount of a protein, 
mRNA or genomic DNA in the preadministration sample; (iii) obtaining one or more post- 
administration samples from the subject; (iv) detecting the level of expression or activity of 
the protein, mRNA or genomic DNA in the post-administration sample; (v) comparing the 
level of expression or activity of the protein, mRNA or genomic DNA in the 
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preadministration sample with the corresponding protein, mRNA or genomic DNA in the 
postadministration sample, respectively; and (vi) altering the administration of the agent to 
the subject accordingly. 

Cells of a subject may also be obtained before and after administration of a 
therapeutic to detect the level of expression of genes other than an IL-1 gene to verify that the 
therapeutic does not increase or decrease the expression of genes which could be deleterious. 
This can be done, e.g., by using the method of transcriptional profiling. Thus, mRNA from 
cells exposed in vivo to a therapeutic and mRNA from the same type of cells that were not 
exposed to the therapeutic could be reverse transcribed and hybridized to a chip containing 
DNA from numerous genes, to thereby compare the expression of genes in cells treated and 
not treated with the therapeutic. 



or haplotype refers to any agent or therapeutic regimen (including pharmaceuticals, 
nutraceuticals and surgical means) that prevents or postpones the development of or alleviates 
the symptoms of the particular disease or condition in the subject. The therapeutic can be a 
polypeptide, peptidomimetic, nucleic acid or other inorganic or organic molecule, preferably 
a "small molecule" including vitamins, minerals and other nutrients. Preferably the 
therapeutic can modulate at least one activity of an IL-1 polypeptide, e.g., interaction with a 
receptor, by mimicking or potentiating (agonizing) or inhibiting (antagonizing) the effects of 
a naturally-occurring polypeptide. An agonist can be a wild-type protein or derivative 
thereof having at least one bioactivity of the wild-type, e.g., receptor binding activity. An 
agonist can also be a compound that upregulates expression of a gene or which increases at 
least one bioactivity of a protein. An agonist can also be a compound which increases the 
interaction of a polypeptide with another molecule, e.g., a receptor. An antagonist can be a 
compound which inhibits or decreases the interaction between a protein and another 
molecule, e.g., a receptor or an agent that blocks signal transduction or post-translation 
processing (e.g., IL-1 converting enzyme (ICE) inhibitor). Accordingly, a preferred 
antagonist is a compound which inhibits or decreases binding to a receptor and thereby 
blocks subsequent activation of the receptor. An antagonist can also be a compound that 
downregulates expression of a gene or which reduces the amount of a protein present. The 
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antagonist can be a dominant negative form of a polypeptide, e.g., a form of a polypeptide 
which is capable of interacting with a target peptide, e.g., a receptor, but which does not 
promote the activation of the receptor. The antagonist can also be a nucleic acid encoding a 
dominant negative form of a polypeptide, an antisense nucleic acid, or a ribozyme capable of 
interacting specifically with an RNA. Yet other antagonists are molecules which bind to a 
polypeptide and inhibit its action. Such molecules include peptides, e.g., forms of target 
peptides which do not have biological activity, and which inhibit binding to receptors. Thus, 
such peptides will bind to the active site of a protein and prevent it from interacting with 
target peptides. Yet other antagonists include antibodies that specifically interact with an 
epitope of a molecule, such that binding interferes with the biological function of the 
polypeptide. In yet another preferred embodiment, the antagonist is a small molecule, such as 
a molecule capable of inhibiting the interaction between a polypeptide and a target receptor. 
Alternatively, the small molecule can function as an antagonist by interacting with sites other 
than the receptor binding site. 

Modulators of IL-1 (e.g. IL-1 a, IL-lp or IL-1 receptor antagonist) or a protein 
encoded by a gene that is in linkage disequilibrium with an IL-1 gene can comprise any type 
of compound, including a protein, peptide, peptidomimetic, small molecule, or nucleic acid. 
Preferred agonists include nucleic acids (e.g. encoding an IL-1 protein or a gene that is up- or 
down-regulated by an IL-1 protein), proteins (e.g. IL-1 proteins or a protein that is up- or 
down-regulated thereby) or a small molecule (e.g. that regulates expression or binding of an 
IL-1 protein). Preferred antagonists, which can be identified, for example, using the assays 
described herein, include nucleic acids (e.g. single (antisense) or double stranded (triplex) 
DNA or PNA and ribozymes), protein (e.g. antibodies) and small molecules that act to 
suppress or inhibit IL-1 transcription and/or protein activity. 

4.3.1. Effective Dose 

Toxicity and therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining The LD50 (the dose lethal to 50% of the population) and the Edso (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio LD50/ED50. 
Compounds which exhibit large therapeutic indices are preferred. While compounds that 
exhibit toxic side effects may be used, care should be taken to design a delivery system that 
targets such compounds to the site of affected tissues in order to minimize potential damage 
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to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used 
in formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little or no 
toxicity. The dosage may vary within this range depending upon the dosage form employed 
and the route of administration utilized. For any compound used in the method of the 
invention, the therapeutically effective dose can be estimated initially from cell culture 
assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC50 (i.e., the concentration of the test compound which 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 
information can be used to more accurately determine useful doses in humans. Levels in 
plasma may be measured, for example, by high performance liquid chromatography. 

4.3.2. Formulation and Use 

Compositions for use in accordance with the present invention may be 
formulated in a conventional manner using one or more physiologically acceptable carriers or 
excipients. Thus, the compounds and their physiologically acceptable salts and solvates may 
be formulated for administration by, for example, injection, inhalation or insufflation (either 
through the mouth or the nose) or oral, buccal, parenteral or rectal administration. 

For such therapy, the compounds of the invention can be formulated for a 
variety of loads of administration, including systemic and topical or localized administration. 
Techniques and formulations generally may be found in Remmington's Pharmaceutical 
Sciences, Meade Publishing Co., Easton, PA. For systemic administration, injection is 
preferred, including intramuscular, intravenous, intraperitoneal, and subcutaneous. For 
injection, the compounds of the invention can be formulated in liquid solutions, preferably in 
physiologically compatible buffers such as Hank's solution or Ringer's solution. In addition, 
the compounds may be formulated in solid form and redissolved or suspended immediately 
prior to use. Lyophilized forms are also included. 

For oral administration, the compositions may take the form of, for example, 
tablets or capsules prepared by conventional means with pharmaceutically acceptable 
excipients such as binding agents (e.g., pregelatinised maize starch, polyvinylpyrrolidone or 
hydroxypropyl methylcellulose); fillers (e.g., lactose, microcrystalline cellulose or calcium 
hydrogen phosphate); lubricants (e.g., magnesium stearate, talc or silica); disintegrants (e.g., 
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potato starch or sodium starch glycolatc); or wetting agents (e.g., sodium lauryl sulfate). The 
tablets may be coated by methods well known in the art. Liquid preparations for oral 
administration may take the form of, for example, solutions, syrups or suspensions, or they 
may be presented as a dry product for constitution with water or other suitable vehicle before 
use. Such liquid preparations may be prepared by conventional means with pharmaceutically 
acceptable additives such as suspending agents (e.g., sorbitol syrup, cellulose derivatives or 
hydrogenated edible fats); emulsifying agents (e.g., lecithin or acacia); non- aqueous vehicles 
(e.g., ationd oil, oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives 
(e.g., methyl or propyl-p-hydroxybenzoates or sorbic acid). The preparations may also 
contain buffer salts, flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give 
controlled release of the active compound. For buccal administration the compositions may 
take the form of tablets or lozenges formulated in conventional manner. For administration 
by inhalation, the compounds for use according to the present invention are conveniently 
delivered in the form of an aerosol spray presentation from pressurized packs or a nebuliser, 
with the use of a suitable propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a pressurized 
aerosol the dosage unit may be determined by providing a valve to deliver a metered amount. 
Capsules and cartridges of e.g., gelatin for use in an inhaler or insufflator may be formulated 
containing a powder mix of the compound and a suitable powder base such as lactose or 
starch. 

The compounds may be formulated for parenteral administration by injection, 
e.g., by bolus injection or continuous infusion. Formulations for injection may be presented 
in unit dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. 
The compositions may take such forms as suspensions, solutions or emulsions in oily or 
aqueous vehicles, and may contain formulating agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may be in powder form for 
constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as 
suppositories or retention enemas, e.g., containing conventional suppository bases such as 
cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may also 
be formulated as a depot preparation. Such long acting formulations may be administered by 
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implantation (for example subcutancously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange 
resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt Other 
suitable delivery systems include microspheres which offer the possibility of local 
noninvasive delivery of drugs over an extended period of time. This technology utilizes 
microspheres of precapillary size which can be injected via a coronary catheter into any 
selected part of the e.g. heart or other organs without causing inflammation or ischemia. The 
administered therapeutic is slowly released from these microspheres and taken up by 
surrounding tissue cells (e.g. endothelial cells). 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration bile salts and fusidic acid derivatives. 
In addition, detergents may be used to facilitate permeation. Transmucosal administration 
may be through nasal sprays or using suppositories. For topical administration, the oligomers 
of the invention are formulated into ointments, salves, gels, or creams as generally known in 
the art. A wash solution can be used locally to treat an injury or inflammation to accelerate 
healing. 

The compositions may, if desired, be presented in a pack or dispenser device 
which may contain one or more unit dosage forms containing the active ingredient. The 
pack may for example comprise metal or plastic foil, such as a blister pack. The pack or 
dispenser device may be accompanied by instructions for administration. 

4.4 Assays to Identify Therapeutics 

Based on the identification of mutations that cause or contribute to the 
development of a disease or disorder that is associated with an IL- 1 polymorphism or 
haplotype, the invention further features cell-based or cell free assays for identifying 
therapeutics. In one embodiment, a cell expressing an IL-1 receptor, or a receptor for a 
protein that is encoded by a gene which is in linkage disequilibrium with an IL-1 gene, on 
the outer surface of its cellular membrane is incubated in the presence of a test compound 
alone or in the presence of a test compound and another protein and the interaction between 
the test compound and the receptor or between the protein (preferably a tagged protein) and 
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the receptor is detected, e.g., by using a microphysiometer (McConnell et al. (1992) 
Science 257: 1906). An interaction between the receptor and either the test compound or the 
protein is detected by the microphysiometer as a change in the acidification of the medium. 
This assay system thus provides a means of identifying molecular antagonists which, for 
example, function by interfering with protein- receptor interactions, as well as molecular 
agonist which, for example, function by activating a receptor. 

Cellular or cell-free assays can also be used to identify compounds which 
modulate expression of an IL-1 gene or a gene in linkage disequilibrium therewith, 
modulate translation of an mRNA, or which modulate the stability of an mRNA or protein. 
Accordingly, in one embodiment, a cell which is capable of producing an IL-1, or other 
protein is incubated with a test compound and the amount of protein produced in the cell 
medium is measured and compared to that produced from a cell which has not been 
contacted with the test compound. The specificity of the compound vis a vis the protein 
can be confirmed by various control analysis, e.g., measuring the expression of one or 
more control genes. In particular, this assay can be used to determine the efficacy of 
antisense, ribozyme and triplex compounds. 

Cell-free assays can also be used to identify compounds which are capable of 
interacting with a protein, to thereby modify the activity of the protein. Such a compound 
can, e.g., modify the structure of a protein thereby effecting its ability to bind to a 
receptor. In a preferred embodiment, cell-free assays for identifying such compounds 
consist essentially in a reaction mixture containing a protein and a test compound or a 
library of test compounds in the presence or absence of a binding partner. A test 
compound can be, e.g., a derivative of a binding partner, e.g., a biologically inactive target 
peptide, or a small molecule. 

Accordingly, one exemplary screening assay of the present invention 
includes the steps of contacting a protein or functional fragment thereof with a test 
compound or library of test compounds and detecting the formation of complexes. For 
detection purposes, the molecule can be labeled with a specific marker and the test 
compound or library of test compounds labeled with a different marker. Interaction of a 
test compound with a protein or fragment thereof can then be detected by determining the 
level of the two labels after an incubation step and a washing step. The presence of two 
labels after the washing step is indicative of an interaction. 
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An interaction between molecules can also be identified by using real-time 
BIA (Biomolecular Interaction Analysis, Pharmacia Biosensor AB) which detects surface 
plasmon resonance (SPR), an optical phenomenon. Detection depends on changes in the 
mass concentration of macromolecules at the biospecific interface, and does not require any 
labeling of interactants. In one embodiment, a library of test compounds can be 
immobilized on a sensor surface, e.g., which forms one wall of a micro-flow cell. A 
solution containing the protein or functional fragment thereof is then flown continuously 
over the sensor surface. A change in the resonance angle as shown on a signal recording, 
indicates that an interaction has occurred. This technique is further described, e.g., in 
BIAtechnology Handbook by Pharmacia. 

Another exemplary screening assay of the present invention includes the steps 
of (a) forming a reaction mixture including: (i) an IL-1 or other protein, (ii) an appropriate 
receptor, and (iii) a test compound; and (b) detecting interaction of the protein and receptor. 
A statistically significant change (potentiation or inhibition) in the interaction of the protein 
and receptor in the presence of the test compound, relative to the interaction in the absence of 
the test compound, indicates a potential antagonist (inhibitor). The compounds of this assay 
can be contacted simultaneously. Alternatively, a protein can first be contacted with a test 
compound for an appropriate amount of time, following which the receptor is added to the 
reaction mixture. The efficacy of the compound can be assessed by generating dose response 
curves from data obtained using various concentrations of the test compound. Moreover, a 
control assay can also be performed to provide a baseline for comparison. 

Complex formation between a protein and receptor may be detected by a 
variety of techniques. Modulation of the formation of complexes can be quantitated using, 
for example, detectably labeled proteins such as radiolabeled, fluorescently labeled, or 
enzymatically labeled proteins or receptors, by immunoassay, or by chromatographic 
detection. 

Typically, it will be desirable to immobilize either the protein or the receptor 
to facilitate separation of complexes from uncomplexed forms of one or both of the proteins, 
as well as to accommodate automation of the assay. Binding of protein and receptor can be 
accomplished in any vessel suitable for containing the reactants. Examples include microtitre 
plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be 
provided which adds a domain that allows the protein to be bound to a matrix. For example, 
glutathione-S-transferase fusion proteins can be adsorbed onto glutathione sepharose beads 
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(Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are then 
combined with the receptor, e.g. an ^ 5 S-labeled receptor, and the test compound, and the 
mixture incubated under conditions conducive to complex formation, e.g. at physiological 
conditions for salt and pH, though slightly more stringent conditions may be desired. 
Following incubation, the beads are washed to remove any unbound label, and the matrix 
immobilized and radiolabel determined directly (e.g. beads placed in scintillant), or in the 
supernatant after the complexes are subsequently dissociated. Alternatively, the complexes 
can be dissociated from the matrix, separated by SDS-PAGE, and the level of protein or 
receptor found in the bead fraction quantitated from the gel using standard electrophoretic 
techniques such as described in the appended examples. Other techniques for immobilizing 
proteins on matrices are also available for use in the subject assay. For instance, either 
protein or receptor can be immobilized utilizing conjugation of biotin and streptavidin. 
Transgenic animals can also be made to identify agonists and antagonists or to confirm the 
safety and efficacy of a candidate therapeutic. Transgenic animals of the invention can 
include non-human animals containing a restenosis causative mutation under the control of an 
appropriate endogenous promoter or under the control of a heterologous promoter. 

The transgenic animals can also be animals containing a transgene, such as 
reporter gene, under the control of an appropriate promoter or fragment thereof. These 
animals are useful, e.g., for identifying drugs that modulate production of an IL-1 protein, 
such as by modulating gene expression. Methods for obtaining transgenic non-human 
animals are well known in the art. In preferred embodiments, the expression of the restenosis 
causative mutation is restricted to specific subsets of cells, tissues or developmental stages 
utilizing, for example, cis-acting sequences that control expression in the desired pattern. In 
the present invention, such mosaic expression of a protein can be essential for many forms of 
lineage analysis and can additionally provide a means to assess the effects of, for example, 
expression level which might grossly alter development in small patches of tissue within an 
otherwise normal embryo. Toward this end, tissue-specific regulatory sequences and 
conditional regulatory sequences can be used to control expression of the mutation in certain 
spatial patterns. Moreover, temporal patterns of expression can be provided by, for example, 
conditional recombination systems or prokaryotic transcriptional regulatory sequences. 
Genetic techniques, which allow for the expression of a mutation can be regulated via site- 
specific genetic manipulation in vivo, are known to those skilled in the art. 

The transgenic animals of the present invention all include within a plurality 
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of their cells a causative mutation transgene of the present invention, which transgene alters 
the phenotype of the "host cell". In an illustrative embodiment, either the creiloxP 
recombinase system of bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et 
at. (1992) PNAS 89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae 
(O'Gorman et al. (1991) Science 251:1351-1355; PCT publication WO 92/15694) can be used 
to generate in vivo site-specific genetic recombination systems. Cre recombinase catalyzes 
the site-specific recombination of an intervening target sequence located between loxP 
sequences. loxP sequences are 34 base pair nucleotide repeat sequences to which the Cre 
recombinase binds and are required for Cre recombinase mediated genetic recombination. 
The orientation of loxP sequences determines whether the intervening target sequence is 
excised or inverted when Cre recombinase is present (Abremski et al. (1984) J. Biol. Chem. 
259:1509-1514); catalyzing the excision of the target sequence when the loxP sequences are 
oriented as direct repeats and catalyzes inversion of the target sequence when loxP sequences 
are oriented as inverted repeats. 

Accordingly, genetic recombination of the target sequence is dependent on 
expression of the Cre recombinase. Expression of the recombinase can be regulated by 
promoter elements which are subject to regulatory control, e.g., tissue-specific, 
developmental stage-specific, inducible or repressible by externally added agents. This 
regulated control will result in genetic recombination of the target sequence only in cells 
where recombinase expression is mediated by the promoter element. Thus, the activation of 
expression of the causative mutation transgene can be regulated via control of recombinase 
expression. 

Use of the creiloxP recombinase system to regulate expression of a causative 
mutation transgene requires the construction of a transgenic animal containing transgenes 
encoding both the Cre recombinase and the subject protein. Animals containing both the Cre 
recombinase and the restenosis causative mutation transgene can be provided through the 
construction of "double" transgenic animals. A convenient method for providing such 
animals is to mate two transgenic animals each containing a transgene. 

Similar conditional transgenes can be provided using prokaryotic promoter 
sequences which require prokaryotic proteins to be simultaneous expressed in order to 
facilitate expression of the transgene. Exemplary promoters and the corresponding trans- 
activating prokaryotic proteins are given in U.S. Patent No. 4,833,080. 

Moreover, expression of the conditional transgenes can be induced by gene 
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therapy-like methods wherein a gene encoding the transactivating protein, e.g. a recombinasc 
or a prokaryotic protein, is delivered to the tissue and caused to be expressed, such as in a 
cell-type specific manner. By this method, the transgene could remain silent into adulthood 
until "turned on" by the introduction of the transactivator. 

In an exemplary embodiment, the "transgenic non-human animals" of the 
invention are produced by introducing transgenes into the germline of the non-human animal. 
Embryonal target cells at various developmental stages can be used to introduce transgenes. 
Different methods are used depending on the stage of development of the embryonal target 
cell. The specific line(s) of any animal used to practice this invention are selected for general 
good health, good embryo yields, good pronuclear visibility in the embryo, and good 
reproductive fitness. In addition, the haplotype is a significant factor. For example, when 
transgenic mice are to be produced, strains such as C5 7BL/6 or FVB lines are often used 
(Jackson Laboratory, Bar Harbor, ME). Preferred strains are those with H-2 b , H-2 d or 
haplotypes such as C57BL/6 or DBA/1. The line(s) used to practice this invention may 
themselves be transgenics, and/or may be knockouts (i.e., obtained from animals which have 
one or more genes partially or completely suppressed) . 

In one embodiment, the transgene construct is introduced into a single stage 
embryo. The zygote is the best target for microinjection. In the mouse, the male 
pronucleus reaches the size of approximately 20 micrometers in diameter which allows 
reproducible injection of 1-2 pi of DNA solution. The use of zygotes as a target for gene 
transfer has a major advantage in that in most cases the injected DNA will be incorporated 
into the host gene before the first cleavage (Brinster et al. (1985) PNAS 82:4438-4442). As 
a consequence, all cells of the transgenic animal will carry the incorporated transgene. This 
will in general also be reflected in the efficient transmission of the transgene to offspring of 
the founder since 50% of the germ cells will harbor the transgene. 

Normally, fertilized embryos are incubated in suitable media until the 
pronuclei appear. At about this time, the nucleotide sequence comprising the transgene is 
introduced into the female or male pronucleus as described below. In some species such as 
mice, the male pronucleus is preferred. It is most preferred that the exogenous genetic 
material be added to the male DNA complement of the zygote prior to its being processed by 
the ovum nucleus or the zygote female pronucleus. It is thought that the ovum nucleus or 
female pronucleus release molecules which affect the male DNA complement, perhaps by 
replacing the protamines of the male DNA with histones. thereby facilitating the combination 
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of the female and male DNA complements to form the diploid zygote. Thus, it is preferred 
that the exogenous genetic material be added to the male complement of DNA or any other 
complement of DNA prior to its being affected by the female pronucleus. For example, the 
exogenous genetic material is added to the early male pronucleus, as soon as possible after 
the formation of the male pronucleus, which is when the male and female pronuclei are well 
separated and both are located close to the cell membrane. Alternatively, the exogenous 
genetic material could be added to the nucleus of the sperm after it has been induced to 
undergo decondensation. Sperm containing the exogenous genetic material can then be added 
to the ovum or the decondensed sperm could be added to the ovum with the transgene 
constructs being added as soon as possible thereafter. 

Introduction of the transgene nucleotide sequence into the embryo may be 
accomplished by any means known in the an such as, for example, microinjection, 
electroporation, or lipofection. Following introduction of the transgene nucleotide sequence 
into the embryo, the embryo may be incubated in vitro for varying amounts of time, or 
reimplanted into the surrogate host, or both. In vitro incubation to maturity is within the 
scope of this invention. One common method in to incubate the embryos in vitro for about 
1-7 days, depending on the species, and then reimplant them into the surrogate host. 

For the purposes of this invention a zygote is essentially the formation of a 
diploid cell which is capable of developing into a complete organism. Generally, the zygote 
will be comprised of an egg containing a nucleus formed, either naturally or artificially, by 
the fusion of two haploid nuclei from a gamete or gametes. Thus, the gamete nuclei must be 
ones which are naturally compatible, i.e., ones which result in a viable zygote capable of 
undergoing differentiation and developing into a functioning organism. Generally, a euploid 
zygote is preferred. If an aneuploid zygote is obtained, then the number of chromosomes 
should not vary by more than one with respect to the euploid number of the organism from 
which either gamete originated. 

In addition to similar biological considerations, physical ones also govern the 
amount (e.g., volume) of exogenous genetic material which can be added to the nucleus of 
the zygote or to the genetic material which forms a part of the zygote nucleus. If no genetic 
material is removed, then the amount of exogenous genetic material which can be added is 
limited by the amount which will be absorbed without being physically disruptive. Generally, 
the volume of exogenous genetic material inserted will not exceed about 10 picoliters. The 
physical effects of addition must not be so great as to physically destroy the viability of the 
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zygote. The biological limit of the number and variety of DNA sequences will vary 
depending upon the particular zygote and functions of the exogenous genetic material and 
will be readily apparent to one skilled in the art. because the genetic material, including the 
exogenous genetic material, of the resulting zygote must be biologically capable of initiating 
and maintaining the differentiation and development of the zygote into a functional organism. 

The number of copies of the transgene constructs which are added to the 
zygote is dependent upon the total amount of exogenous genetic material added and will be 
the amount which enables the genetic transformation to occur. Theoretically only one copy is 
required; however, generally, numerous copies are utilized, for example, 1,000-20,000 copies 
of the transgene construct, in order to insure that one copy is functional. As regards the 
present invention, there will often be an advantage to having more than one functioning copy 
of each of the inserted exogenous DNA sequences to enhance the phenotypic expression of 
the exogenous DNA sequences. 

Any technique which allows for the addition of the exogenous genetic material 
into nucleic genetic material can be utilized so long as it is not destructive to the cell, nuclear 
membrane or other existing cellular or genetic structures. The exogenous genetic material is 
preferentially inserted into the nucleic genetic material by microinjection. Microinjection of 
cells and cellular structures is known and is used in the art. 

Reimplantation is accomplished using standard methods. Usually, the 
surrogate host is anesthetized, and the embryos are inserted into the oviduct. The number of 
embryos implanted into a particular host will vary by species, but will usually be comparable 
to the number of off spring the species naturally produces. 

Transgenic offspring of the surrogate host may be screened for the presence 
and/or expression of the transgene by any suitable method. Screening is often accomplished 
by Southern blot or Northern blot analysis, using a probe that is complementary to at least a 
portion of the transgene. Western blot analysis using an antibody against the protein encoded 
by the transgene may be employed as an alternative or additional method for screening for the 
presence of the transgene product. Typically, DNA is prepared from tail tissue and analyzed 
by Southern analysis or PCR for the transgene. Alternatively, the tissues or cells believed to 
express the transgene at the highest levels are tested for the presence and expression of the 
transgene using Southern analysis or PCR, although any tissues or cell types may be used for 
this analysis. 

Alternative or additional methods for evaluating the presence of the transgene 
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include, without limitation, suitable biochemical assays such as enzyme and/or 
immunological assays, histological stains for particular marker or enzyme activities, flow 
cytometric analysis, and the like. Analysis of the blood may also be useful to detect the 
presence of the transgene product in the blood, as well as to evaluate the effect of the 
transgene on the levels of various types of blood cells and other blood constituents. 

Progeny of the transgenic animals may be obtained by mating the transgenic 
animal with a suitable partner, or by in vitro fertilization of eggs and/or sperm obtained from 
the transgenic animal. Where mating with a partner is to be performed, the partner may or 
may not be transgenic and/or a knockout; where it is transgenic, it may contain the same or a 
different transgene, or both. Alternatively, the partner may be a parental line. Where in vitro 
fertilization is used, the fertilized embryo may be implanted into a surrogate host or incubated 
in vitro, or both. Using either method, the progeny may be evaluated for the presence of the 
transgene using methods described above, or other appropriate methods. 

The transgenic animals produced in accordance with the present invention will 
include exogenous genetic material. Further, in such embodiments the sequence will be 
attached to a transcriptional control element, e.g., a promoter, which preferably allows the 
expression of the transgene product in a specific type of cell. 

Retroviral infection can also be used to introduce the transgene into a non- 
human animal. The developing non-human embryo can be cultured in vitro to the blastocyst 
stage. During this time, the blastomeres can be targets for retroviral infection (Jaenich. R. 
(1976) PNAS 73:1260-1264). Efficient infection of the blastomeres is obtained by enzymatic 
treatment to remove the zona pellucida {Manipulating the Mouse Embryo, Hogan eds. (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor. 1986). The viral vector system used to 
introduce the transgene is typically a replication-defective retrovirus carrying the transgene 
(Jahner et al. (1985) PNAS 82:6927-6931 ; Van der Putten et al. (1985) PNAS 82:6148-6152). 
Transfection is easily and efficiently obtained by culturing the blastomeres on a monolayer of 
virus-producing cells (Van der Putten, supra; Stewart et al. (1987) EMBO J. 6:383-388). 
Alternatively, infection can be performed at a later stage. Virus or virus-producing cells can 
be injected into the blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the 
founders will be mosaic for the transgene since incorporation occurs only in a subset of the 
cells which formed the transgenic non-human animal. Further, the founder may contain 
various retroviral insertions of the transgene at different positions in the genome which 
generally will segregate in the offspring. In addition, it is also possible to introduce 
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transgenes into the germ line by intrauterine retroviral infection of the midgestation embryo 
(Jahner et al. (1982) supra). 

A third type of target cell for transgene introduction is the embryonal stem cell 
(ES). ES cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al. (1984) Nature 309:255-258; 
Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) Nature 322:445-448). 
Transgenes can be efficiently introduced into the ES cells by DNA transfection or by 
retrovirus-mediated transduction. Such transformed ES cells can thereafter be combined with 
blastocysts from a non-human animal. The ES cells thereafter colonize the embryo and 
contribute to the germ line of the resulting chimeric animal. For review see Jaenisch, R. 
( 1 988) Science 240: 1 468- 1 474. 

The present invention is further illustrated by the following examples which 
should not be construed as limiting in any way. The contents of all cited references 
(including literature references, issued patents, published patent applications as cited 
throughout this application) are hereby expressly incorporated by reference. The practice of 
the present invention will employ, unless otherwise indicated, conventional techniques that 
are within the skill of the art. Such techniques are explained fully in the literature. See, for 
example, Molecular Cloning A Laboratory Manual, (2nd ed., Sambrook, Fritsch and 
Maniatis, eds., Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II 
(D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); U.S. Patent No. 
4,683,195; U.S. Patent No. 4,683,202; and Nucleic Acid Hybridization (B. D. Hames & S. J. 
Higgins eds., 1984). 
5. EXAMPLES 
Example 1 Genotyping 

All human subjects were unrelated, Caucasian, healthy blood donors from 
Sheffield (n = 112). Subjects were typed at the loci indicated in Table 1. 



Table 2. Markers Used in Haplotype Study 



Marker 


Gene 


Reference 


2221223 


ILIA 


Todd& Naylor, Nucleic Acids Res. 19: 3756(1991) 


gz5/gz6 


ILIA 


Zuliani, et al., Am. J. Hum. Genet. 46: 963-69 (1990) 


-889 


ILIA 


McDowell, etal, Arth. & Rheum. 38: 221 -8(1995) 


+ 3954 


IL1B 


di Giovine, et al, Cytokine 7(6): 606 (1995) 
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-511 


IL1B 


di Giovine, Hum, Molec. Genet. 1(6): 450 (1992) 


gaat.p33330 


between IL1B 
and IL1RN 


Murray, etal., Coop. Hum. Link. Center, unpublished 


Y31 


between IL1B 
and IL1RN 


Spurr, etaL, Cytogenet. & Cell Genet. 73: 255-73 (1996) 


VNTR 


IL1RN 


Tarlow, etaL, Hum. Genet. 91: 403-4 (1993) 



The primer sequences and fluorescent labels used in PCR amplification of 
markers were as in Table 3. 



Table 2. Primer Sequence and Flourescent Label for Genotyping 



Marker 


Label 


Primer Sequence 


2221223 


HEX 


ATGTATAGAATTCCATTCCTG (SEQ ID NO. 8) 




TA A A ATPA A TtTnTTr, A T^.T A f, /CCH Tn \TD Q\ 
1 A AAA I LAAu lul 1 LiA lul AO ^oty 1JJ INU. 7) 


gz51gz6 


FAM 


GGGA7TACAGGCGTGAGCCACCGCG (SEQ ID NO. 10) 




TTAGTATTGCTGGTAGTATTCATAT (SEQ ID NO. 11) 


-889 


NONE 


TGTTCTACCACCTGAACTAGG (SEQ ID NO. 12) 




TTACATATGAGCCTTCCATG (SEQ ID NO. 13) 


+3954 


NONE 


CTCAGGTGTCCTCGAAGAAATCAAA (SEQ ID NO. 14) 




GCTTTMGCTGTGAGTCCCG (SEQ ID NO. 15) 


-511 


NONE 


TGGCATTGATCTGGTTCATC (SEQ ID NO. 16) 






GTTTAGGAATCTTCCCACTT(SEO ID NO. 17) 


gaat.p33330 


FAM 


GAGGCGTGAGAATCTCAAGA (SEQ ID NO. 18) 




GTGTCCTCAAGTGGATCTGG (SEQ 10 NO. 19) 


Y31 


HEX 


GGGCAACAGAGCAATGTTTCT (SEQ ID NO. 20) 




CAGTGTGTCAGTGTACTGTT (SEQ ID NO. 21) 


VNTR 


NONE 


CTCAGCAACACTCCTAT (SEQ ID NO. 22) 




TCCTGGTCTGCAGGTAA (SEQ ID NO. 23) 



Reaction conditions were as described in Table 4. 



Table 4. Reaction Conditions 
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Marker 


Conditions 


222/223 


50 mM KCI. 10 mM Tris-HCI pH 9.0,1.5 mM MgCI 2 , 200:mM dNTPs, 25 
ng primers, 50 ng template, 0.004% W-l (Gibco-BRL) 0.2 u Taq, PCR was 
done at 30 cycles of 94°C for 1". 55 °C for 1", 72°C for 1" 


gz5/gz6 


as per marker 2221223, except 1 u of Perfect Match (StrataGene) was added 


-889 


per marker 222/223, except PCR was done for 1 cycle at 96°C for 1", 40 
cycles of 94°C for 1", 46°C for 1" 72°C for 1" and 1 cycle of 72°C for 4", 
products were cleaved with Ncol for analysis 


+ 3954 


as per marker 222/223, except PCR was done for 35 cycles with annealing at 
67.5 °C, products were cleaved with Taq 1 for analysis 


-511 


as per marker 2221223, except PCR was done for 1 cycle at 95 °C for 2", 35 
cycles or l ror 1 , jj l ror l /4 L. tor l and i cycle or /4 l tor 4 , 
products were cleaved with Aval and Bsu361 for analysis 


gaat.p33330 


per marker 222/223 


Y31 


per marker 222/223 


VNTR 


per marker 222/223 except with 1.7 mM MgCI,, 1 cycle at 96°C for 1"; 30 
cycles of 94°C for 1", 60°C for 1", 70°C for V and 1 cycle at 70°C for 2" 



2221223, gz5/gz6, gaat.p33330 and Y31 PCR products were examined by 
agarose gel electrophoresis and the remainder of the PCR products were pooled according to 
the intensity of ethidium bromide staining. 2 gl of the pool was analyzed on an ABI 373A 
automated sequencer and allele sizes were determined using the Genescan and Genotyper 
software. Alleles were globally binned using a simple computer program and numbered in 
order of size. 

-889 PCR products were digested with Ncol and the resulting fragments sized 
on 8% PAGE. Allele 1 produces 83 and 16 bp fragments. Allele 2 produces a 99 bp 
fragment. 

+3954 PCR products were digested with restriction enzyme Taq I. Allele 1 
produces fragments of 97, 85 and 12 bp, and allele 2 produces fragments of 182 and 12 bp. 

-511 PCR products were digested with Aval and Bsu36I and the fragments 
were sized by 8% PAGE. Allele 1 produces 190 and 1 14 bp fragments when digested with 
Aval and a 304 bp fragment when digested with Bsu36I. Allele 2 produces a 304 bp 
fragment when digested with Aval and 190 and 1 14 bp fragments when digested with 
Bsu36I. 

VNTR PCR products were sized by electrophoresis on 2% agarose gel at 90V 
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for 45 minutes. Allele 1 has 4 repeats and the PCR product is 412 bp, allele 2 has 2 repeats 
and the PCR product is 240 bp, allele 3 has 3 repeats and the PCR product is 326 bp, allele 4 
has 4 repeats and the PCR product is 498 bp, allele 5 has 6 repeats and the PCR product is 
584 bp. 

Intergenic distances were determined by estimation based on the insert sizes of 
relevant PAC clones from a contig spanning the IL-1 gene cluster (Nicklin, et ai. Genomics 
19:382-4 (1994)/ Intragenic distances were determined from the relevant nucleotide 
sequence obtained form the GENBANK database. 



Example 2 Method for Estimating Linkage Disequilibrium 

Because four of the markers studied herein are multiallelic, a preliminary 
analysis was carried out to determine which allelic combinations between pairs of loci 
contributed to the greatest disequilibrium, in order that the disequilibrium would not be 
masked when the alleles were grouped into biallelic systems. The E.H. program of Xie and 
Ott (Handbook of Human Genetic Link-age, 1994, John Hopkins University Press, 188-98), 
incorporated by reference herein, was used to estimate haplotype frequencies under H 0 (no 
linkage) and H, (allelic linkage allowed). It was found that the elaborate allele grouping, 
strategy had some advantages over commonly used methods, in that disequilibrium was 
detected between almost all pairwise combinations of markers examined and there was good 
correlation between disequilibrium and physical distance. 

More specifically, the E.H. program of Xie and Ott was used to determine 
maximum likelihood estimates of disequilibriumn (D } ) between each pairwise combination of 
alleles, where D l} = \\- - p^ are the frequencies for allele i at locus 1 and allele j at locus 2 
respectively, and hy is the frequency of the haplotype ij. The program calculated maximum 
likelihood values for the haplotype frequencies (and hence allele frequencies) under H 0 (no 
association) and haplotype frequencies under Hj (allelic association allowed). For markers 
with greater than two alleles, the E. H. estimate for allele frequencies correlated poorly with 
the allele frequencies as estimated directly from the sample population, and therefore gave no 
confidence to the Dg estimates given. It was therefore necessary to group alleles of the 
multi-allelic markers into a biallelic system. Analysis of the markers in a biallelic format has 
the added advantages that the notation D ijs p J( and ^ can be simplified to D, p, and q 
respectively, where p and q are defined to be the frequencies of the rarer alleles at both loci 
(such that without loss of generality p < q <, 0.5), andD is the estimated disequilibrium 
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between those alleles. 

Under a biallelic system, power is also much simpler to determine using 
equations as detailed by Hill (Hill, Heredity, 33: 229-39 (1974)). In addition, the sign off) 

A 

becomes informative, such that D > 0 when the rarer alleles at each of the two loci are 

A 

associated, and D < 0 when the rare allele at one locus is associated with the common allele at 
the other locus. 

Because the method of allele grouping clearly affected the power to detect 
disequilibrium (Zouros, ei al.. Genet. 85: 543-50 (1977); Weir, et al., Genet. 88: 633-42 
(1976)), a preliminary analysis was conducted to ensure that the grouping did not mask 
disequilibrium between subsets of alleles. In this analysis, 5 y - (O^j - E^/V E 0 was calculated 
for each haplotype, where H ;j is the expected number of haplotypes ij assuming equilibrium 
(E,j = 2n p,q r where n = number of individuals in the study), and 0' } is a basic estimate for the 
observed haplotype count determined as follows. All genotypes that could be unambiguously 
resolved were haplotype counted. Each double heterozygote (i^/j j 2 ) could be resolved into 
two possible haplotype sets, [iiJ,,iJJ or [i,J 2f y,]. Using the haplotype frequencies as 
estimated from the unambiguous haplotype count, the probability of each set was calculated 
and used as a "partial" count. In this way the ambiguous genotypes were also haplotype 
counted, and the total counts (ambiguous plus unambiguous) constituted the O^'s used in 8, r 
Once established, the magnitude and sign of the S^'s were used to determine which allelic 
combinations showed greatest deviation from the null hypothesis of no association. This 
information was used to group alleles at the multiallelic loci into biallelic systems to enable 
efficient use of the E.H. program. 

In order to compare the degree of disequilibrium between different pairwise 
combinations of loci, a frequency independent measure of disequilibrium D, the proportion of 
maximum possible disequilibrium in the given direction) was calculated, where D = D/|D mitl 
(Thompson, et al., Am. J. Hum Genet. 42: 1 13-24 (1988)). The relationship between p and q 
are such that p s q ^ 0.5, and it can therefore be written that -pq ^ D $ p(l-q) such that when 

A A 

D< 0, D mix = -pq and when D>0, D max = p(l-q). Output from the E.H. program included 
log-likelihoods for the maximum likelihood parameter values under H 0 and H,, and since 
-2 In (Lo/L,)~Xp where L 0 and L, are the likelihoods under H 0 and H,, p- values could then 
be determined for each test. 

A 

The asymptotic variance for D, under H 0 : D = 0 and H, were computed using 
the formula as defined by Hill (Heredity 33: 229-39 (1974)) for genotypic data. Using these. 
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the power for each pamvise comparison could be calculated. 

Common haplotypes containing all 8 loci were identified from the preliminary- 
analysis of 6,j described above, and backed up by the magnitude and sign of the disequilibna 
once the alleles at the multiallelic loci had been grouped. For these loci, the allele in the 
group which contributed most to the disequilibrium has been identified on the haplotype. To 
estimate the population haplotype frequencies, rates of carriage of at least one copy of the 
relevant alleles in the population were determined. These do not represent true haplotypes 
since phase is unknown. Monte Carlo simulation techniques were used to test for significant 
deviation from a simulated null distribution for these combined carriages under the 
assumption of no association. 



Example 3 Estimation of Linkage Disequilibrium in the 11 ,-1 G ene Cl uster 

A number of biallehc and multiallelic markers in and around the IL-1 genes 
have been identified. However, the extent of linkage disequilibrium between the markers, 
and the prevalence of multimarker haplotypes in the general population have not until now 
been identified. 

Figure 1 shows the relative positions of the 8 marker loci used in this study. 
DNA samples from 212 unrelated healthy volunteers were genotyped for each of these 
markers, and the resulting estimates of allele frequencies are shown in Table 5. 
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Table 5. Estimated frequencies nf marker alleles 



222/223 


freq. 


iiz5/gz6 


freq. 


-889 


free). 


+3953 


freq. 


1(126 bp} 


0.005 


1(79 bp) 


0.003 


l(NcoI) 


0.714 


1(2 
Taql) 


0.812 


2(128 bp) 


0.018 


2 (82 bp) 


0.005 


2 


0.286 


2 


0.188 


3 (130 bp) 


0.378 


3 (88 bp) 


0.676 










4(132 bp) 


0.299 


4 (91 bp) 


0.316 










5 (134 bp) 


0.016 














6 (136 bp) 


0.208 














7 (138 bp) 


0.055 














8 (140 bp) 


0.003 














9 (142 bp) 


0.010 














10(144 bp) 


0.008 














♦total 


384 




392 




398 




398 


-511 


freq. 


gaat.p33330 


freq. 


Y31 


freq. 


VNTR 


freq. 


1 


0.618 


1 (189 bp) 


0.658 


1 (148 bp) 


0.092 


1 


0.744 


2 (Bsu361) 


0.382 


2 (193 bp) 


0.002 


2 (158 bp) 


0.008 


2 


0.256 






3 (197 bp) 


0.255 


3(160 bp) 


0.454 










4(201 bp) 


0.084 


4(162 bp) 


0.062 














5 (164 bp) 


0.003 














6 (166 bp) 


0.122 














7(168 bp) 


0.035 














8(170 bp) 


0.030 














9(172 bp) 


0.095 














10(174 bp) 


0.087 














11 (176bp) 


0.003 














12 (178 bp) 


0.011 








398 




404 




370 




398 



*number of chromosomes analyzed 

Note - Allele names (and sizes) are given in boldface. 



To determine the linkage disequilibria between pairwise combinations ofloci, 
the computer program of Xie and Ott was used. This program was found to be most efficient 
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when used with biallelic systems, therefore alleles at the multiallelic loci were grouped in the 
most appropriate way for each pairwise comparison, such that disequilibrium between subsets 
of alleles was not masked. 

In Table 6, the disequilibria between pairs of loci are expressed as D, the ratio 

A 

of D to its maximum value D mi)t and are shown together with the approximate physical 
distances between the loci in kilobase pairs. 

^ A 

Table 6. Disequilibrium (D =D/|D mi J ) and physical distances between markers 





222/223 


gz5/gz6 


-889 


+3953 


222/223 


- 


+0.872 


+0.829 


+0.710 


gz5/gz6 


2.5 


- 


-0.889 


-0.695 


-889 


7 


4,5 


- 


+0.804 


+3953 


55 


55 


50 




-511 


60 


60 


55 


4.5 


gaat.p33330 


260 


260 


255 


205 


Y31 


310 


310 


305 


255 


VNTR 


380 


380 


375 


325 




-511 


gaat.p33330 


Y31 


VNTR 


222/223 


+0.535 


+0433 


+0.364 


-0.499 


gz5/gz6 


+0.540 


+0.517 


-0.503 


+0.286 


-889 


-0.264 


+0.337 


+0.318 


-0.207 


+3954 


-0.617 


+0.409 


-0.475 


0.439 


-5 1 1 




+0.691 


-0.456 


+0.448 


gaat.p33330 


200 




+0.639 


+0.442 


Y31 


250 


50 




-0.765 


VNTR 


320 


120 


70 





Note - disequilibrium values are shown at the top right, approximate physical distances in Kb are shown at the 
bottom left. Intergenic distances are given to the nearest 5 Kb. 

Table 7 shows the power to detect 50% D mai for each locus combination, and 
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the p values for each corresponding D. 

Table 7. Power to detect 50% D mtx and p values of -2Ln (L Q L,) 









-889 

OO 7 


-1-3953 


LULLS 










gZCVgZO 


<i Yin 10 




R7M 




-say 


<;1 vlO' 10 


<1y1 (V s 

^JA 1 U 




.1 ()()(+) 




1 y1 O" 7 


*-Qy 1 0° 


<1 xl O" 10 




-S1 1 


-9xl0" 10 


-4x10" 10 


*-9.4xl0" 2 


*-2.6xl0* 2 


gdal.pj J J U 


-9y 1 0" 8 


-2xl0' 9 


*-l 7xl0' 2 


-5X10" 4 


Y31 


-IxlO" 4 


-4xl0' 4 


-6x1 0* 4 


-IxlO' 7 


VNTR 

VIM II\ 


-IxlO" 3 


-IxlO" 3 


*-3xl0" 1 


*-1.2xl0~' 






eaat d33330 


Y31 


VNTR 


22/223 


~100f+1 


~100(+) 


~100(+) 






~100(+) 


-100M 


98M 


~100(+) 


-889 


96(-) 


89(+) 


~100(+) 


78(-) 


+3954 


79(-) 


97(+) 


~100(+) 


52(-) 


-511 




~100(+) 


~100(-) 


~100(+) 


gaat.p33330 


<lxl0-'° 




49(+) 


~100(+) 


Y31 


~2xl 0 -4 


*~7xl0" 3 




89(-) 


VNTR 


~8xl 0' 6 


-IxlO" 9 


2xl0" 7 





Note - Power is shown at the top right with the sign of disequilibrium in brackets; pointwise 
p-values are shown (uncorrected) at the bottom left. For an overall significance level of p = 0.05, 
pointwise significance level is 0.0018 for 28 comparisons. 
* Not significant at p = 0.0018 threshhold 



Significant linkage disequilibrium (p cori < 0.05) was detected between most 
combinations of loci, with only a few exceptions. These include the comparisons between the 
VNTR and the more distant biallelic markers, +3954, and -889, in which the disequilibrium is 
in the negative direction and consequently the power is reduced (Table 7). The correlation 
between disequilibrium D and physical distance was r= -0.752 (p < 0.0001, one tailed) 
(Figure,2). 

In order to compare different grouping methods for the muluallelic markers, D 
was calculated for all the comparisons involving 222/223 using two additional grouping 



-55- 




WO 01/00880 PCT/USOO/18318 

strategies. The first of these was a "common allele versus the rest" approach, and the second 
was a grouping based on allele size, using the bimodal distribution of allele frequency versus 
size which was observed for all the multialleHc markers examined. The results of this analysis 
are shown in Table 8, where D values for the three grouping methods are compared. 

Table 8.D values for three methods of grouping alleles at the multiallelic marker loci 







common vs. rest 


allele size 


gz5/gz6 


0.87 


0.79 


0.77 


-899 


0.83 


0.81 


0.98 


+3954 


0.71 


♦0.74 


0.77 


-511 


0.54 


♦0.15 


0.61 


gaat.p33330 


0.43 


*0.03 


0.53 


Y31 


0.36 


♦0.12 


0.16 


VNTR 


0.5 


0.48 


*0.04 



Note- Values re given for the disequilibrium between 222/223 and the other markers listed. 
* indicates not significant at p = 0.05 level, even before correction for multiple testing. 



It can be seen that the disequilibrium is not detected in several instances using 
these other grouping strategies, notably 222/223 with -511 and gaat.p33330 in the common 
versus rest approach, 222/223 with Y31 in both the common versus rest and allele size 
approaches, and 222/223 with VNTR in the allele size approach. 

Examination of which alleles of the multiallelic loci were contributing greatest 
to the disequilibrium, from the determination of 5^ revealed the existence of 2 haplotypes 
containing alleles of all 8 loci. These were confirmed by examination of the haplotype 
frequencies and disequilibrium values obtained after the grouping. The first haplotype: alleles 
441 12332 (expressed in chromosome order, see Fig. 1) is the most common (carriage of 
34/198), and is present 7 times more frequently than expected (expected = 4.5/198) (p < 
0.000001). The second haplotype: alleles 33221461 (carriage of 2/206) was present 4 times 
more frequently than expected (expected 0.5/206), but this was not statistically significant (p 
~ 0. 106). However, examination of a larger sample size might assist in increasing the 
statistical significance of this finding. 

The data presented indicate a significant degree of linkage disequilibrium 
across an approximately 400 Kb stretch of chromosome 2ql3. The disequilibrium was strong 
both for the three markers within the IL-la gene, as might be expected, but was also strong 
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between some of the more distantly separated markers (-899/+3954;D = +0.804, physical 
distance = 50 Kb) (Table 6). However, D was considerably diminished between the extreme 
ends of the cluster. Within the IL-lp gene, a moderate value of D (+3954/-51 1; D = -0.617) 
was obtained, although this was not significant when corrected for multiple comparisons, 
probably reflecting the reduction in power when disequilibrium is in the negative direction 
(Thompson, et al, Am J Hum. Genet. 42: J 13-24 (1988)). 

Overall, there is a good correlation between physical distance and linkage 
disequilibrium (Figure 2); r = -0.752. The reliability of r itself depends partly on the 
reliability of the estimates of both physical distance and D. Over the short distances, the 
physical distances are accurate since they are determined from known DNA sequence, 
whereas the longer range estimates are less precise. The power can be taken tentatively as one 
indicator of the reliability of D, since if power is low this indicates that the sample size was 
too small, and with low sample sizes the estimates for D may be unreliable. 

The success of the elaborate grouping strategy is indicated by Table 8, which 
shows several instances where disequilibrium between particular loci is apparently low or not 
detected when other commonly used grouping methods are employed. The disadvantages of 
the grouping strategy used here are that it is rather laborious since the information used for 
the grouping was based on an approximate estimate of the "observed" haplotype frequencies 
(see Example 2). For the more polymorphic markers the higher heterozygosity meant that the 
estimate of 5 ijt was less precise since there was a higher proportion of ambiguous haplotypes. 
Notwithstanding this drawback, care was taken to take into account both the sign and 
magnitude of and the frequencies of the alleles concerned. 

The method could be simplified, in a sufficiently large study, by just 
considering the unambiguous haplotypes when determining the grouping. The determination 
of 8,j uses the maximum amount of prior knowledge for the grouping of the multiallelic 
markers, and this may be the reason why disequilibrium between almost all pairwise 
combinations of markers was detected. 

The two haplotypes containing all 8 markers, as well as other shorter 
haplotypes, are of particular interest since it is likely that particular combinations of alleles of 
the IL- 1 genes may act in concert to determine an overall inflammatory phenotype. An 
understanding of which markers are in strong linkage disequilibrium not only allows for more 
rational design of genetic studies but also may provide clues to disease mechanism. 
Therefore, in addition to the alleles identified herein, the IL-1 (441 12332) haplotype may 
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contain the following alleles: 



allele 2 of the 1731 marker of the ILIRN gene(A at position 173 1); 

allele 2 of the 1812 marker of the IL1RN gene (A at position 1812); 

allele 2 of the 1868 marker of the ILIRN gene(G at position 1868); 

allele 2 of the 1887 marker of the iLIRN gene(C at position 1887); 

allele 2 of the 8006 marker of the ILIRN gene (contains a Hpall or Mspl site) 

allele 2 of the 8061 marker of the ILIRN gene (lacks a Mwol site) 

allele 2 of the 9589 marker of the ILIRN gene (contains an Sspl site) 

Furthermore, the following PCR primers may be used to amplify these alleles: 

TTACGCAGATAAGAACCAGTTTGG (SEQ ID NO. 24) 
TTTCCTGGACGCTTGCTCACCA (SEQ ID NO. 25) 
(used for 1731, 1812, 1868, and 1887) 

TTCTATCTGAGGAACAACCAACTAGTAGC (SEQ ID NO. 26) 
CACCAGACTTGACACAGGACAGGCACATC (SEQ ID NO. 27) 
(used for 8006) 

CGACCCTCTGGGAGAAAATCCAGCAAG (SEQ ID NO. 28) 
(used with SEQ ID NO. 20 for 8006) 

ACACAGGAAGGTGCCAAGCA (SEQ ID NO. 29) 
TGCAGACAGACGGGCAAAGT (SEQ ID NO. 30) 
(used for 8006 and 9589) 

TTGTGGGGACCAGGGGAGAT (SEQ ID NO. 31), and 
AGCCTGGCACTCTGCTGAAT (SEQ ID NO. 32) 
(used for 9589). 



Example 4 The IL-1 (441123-32) Haplotype is Associated with Diabetic Nephropathy 

The presence of the two haplotypes described herein was investigated in 
healthy and diseased populations to determine if the haplotypes were associated with 
inflammatory disease. 8 1 non-insulin dependant diabetes mellitus (NIDDM) patients with 
nephropathy were compared with 198 ethnically matched healthy subjects in example 3 and 
147 NIDDM patients without nephropathy. Genotyping was carried out as in example 1 . 
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The IL-1 (441 12332) haplotype was carried by 24 of 79 of the NIDDM 
nephropathy patients and 25 of 141 NIDDM without nephropathy patients. However, the 
second haplotype (3322146 1) was not found in the nephropathy patients (0/8 1). The IL-1 
(441 12332) haplotype was significantly over represented in the patient group compared with 
the healthy control group (24/79 vs. 34/198; p = 0.015) and the NIDDM without nephropathy 
group (24/79 vs. 25/141; p = 0.03). 

Example 5 An IL-1 Haplotype is Associated with Inflammatory Disease 

This is a prophetic example. Other diseases are examined as per Example 4. 

The IL-1 (441 12332) haplotype is found to be associated with coronary artery disease, 

osteoporosis, nephropathy in diabetes mellitus, alopecia areata, Graves disease, systemic 

lupus erythematosus, lichen sclerosis and ulcerative colitis. 

Likewise, the IL-1 (33221461) haplotype is associated with periodontal 

disease, juvenile chronic arthritis, psoriasis, insulin dependant diabetes and diabetic 

retinopathy. 

Example 6 Novel Markers are Linked to an TL-1 Haplotype 

This is a prophetic example. Additional markers are identified by sequence 
and restriction enzyme analysis of the 2ql3-14 region. These new markers are identified as 
belonging to an IL-1 haplotype in the manner described in Examples 2 and 3. 

Example 7 The TL-1 (441 12332) Haplotype Is Used to Predict Disease Susceptibility 

This is a prophetic example. A patient with a family history of ulcerative 
colitis is genotyped for the presence of the IL-1 (441 12332) haplotype. Genotyping is 
performed as in Example I and the patient is determined to carry one or more alleles of the 
haplotype. The patient is thus treated with IL-1 antagonists to prevent disease. 

A second patient with a family history of coronary artery disease is genotyped 
at the IL-1 gene cluster. The patient is found to carry one or more alleles; of the IL-1 
(441 12332) haplotype and be homozygous for the VNTR allele 2. Thus, the patient is 5.4 
times as likely to develop coronary artery disease as the general population and is treated 
vigorously to prevent disease. 

Example 8 Additional Haplotypes are Statistically Significant 
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This is a prophetic example. An additional 400 chromosomes are typed as per 
Example 1 and linkage disequilibrium assessed as per Example 2. The IL-1 (33221461) 
haplotype is found to be present about 4 times more frequently than expected (p - 0.05). 

In a similar manner, the following markers are determined to be present in the 

IL- 1 (441 12332) haplotype (p « 0.05). 

allele 2 of the 1731 marker of the IL1RN gene 
allele 2 of the 1812 marker of the It 1RN gene 
allele 2 of the 1 868 marker of the IL1RN gene 
allele 2 of the 1887 marker of the IL1RN gene 
allele 2 of the 8006 marker of the IL1RN gene 
allele 2 of the 8061 marker of the IL1RN gene 
allele 2 of the 9589 marker of the IL1RN gene 
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WHAT IS CLAIMED IS: 

1 . A method for determining whether a subject has or is predisposed to 
developing a disease or condition that is associated with an IL-1 inflammatory haplotype, 
comprising detecting at least one allele of the haplotype, wherein the presence of the allele 
indicates that the subject is predisposed to the development or has the disease or condition. 

2. A method of claim 1, wherein the disease of condition is selected from the 
group consisting of an inflammatory disease, a degenerative disease an immunological 
disorder, an infectious disease, a trauma induced disease, and a cancer. 

3. A method of claim 1, wherein said detecting step is selected from the group 

consisting of: 



a) 


allele specific oligonucleotide hybridization; 


b) 


size analysis; 


c) 


sequencing; 


d) 


hybridization; 


e) 


5' nuclease digestion; 


0 


single-stranded conformation polymorphism; 


g) 


allele specific hybridization; 


h) 


primer specific extension; and 


J) 


oligonucleotide ligation assay. 



4. A method of claim 1 , wherein prior to or in conjunction with detection, the 
nucleic acid sample is subject to an amplification step. 

5. A method of claim 4, wherein said amplification step employs a primer 
selected from the group consisting of any of SEQ ID Nos.8-32. 

6. A method of claim 3, wherein said size analysis is preceded by a restriction 
enzyme digestion. 
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7. A kit comprising a primer selected from the group consisting of any of 
SEQ ID NOs. 8-32. 

8. A method for selecting an appropriate therapeutic for an individual that has 
or is predisposed to developing a disease or disorder that is associated with an IL-1 
polymorphism, comprising the steps of: detecting whether the subject contains the 
polymorphism and selecting a therapeutic that compensates for a causative functional 
mutation that is in linkage disequilibrium with the IL-1 polymorphism. 

9. A method of claim 8, wherein said detecting is performed using a technique 
selected from the group consisting of: 



10. A method of claim 8, wherein prior to or in conjunction with detecting, 
the nucleic acid sample is subjected to an amplification step. 

1 1 . A method of claim 10, wherein said amplification step employs a primer 
selected from the group consisting of SEQ ID Nos. 8-32. 

12. A method of claim 9, wherein said size analysis is preceded by a 
restriction enzyme digestion. 

13. A method of claim 9, wherein the disease or condition is selected from the 
group consisting of: 



a) 
b) 
c) 
d) 
e) 
f) 
g) 
h) 
j) 



allele specific oligonucleotide hybridization; 
size analysis; 



sequencing; 



hybridization; 

5' nuclease digestion; 

single-stranded conformation polymorphism; 
allele specific hybridization; 
primer specific extension; and 
oligonucleotide ligation assay. 



14. A method of claim 9, wherein the therapeutic is a modulator of an IL-1 
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activity. 



15. A method of claim 14, wherein the IL-l activity is IL-la. 



16. A method of claim 14, wherein the IL-l activity is IL-l p. 



17. A method of claim 14, wherein the IL-l activity is IL-1RN. 



18. A method of claim 14, wherein the modulator of an IL-l activity is a 
protein, peptide, peptidomimetic, small molecule, nucleic acid or a nutraceutical. 



21. A method for determining the effectiveness of treating a subject that has 
or is predisposed to developing a disease or condition that is associated with an IL-l 
polymorphism with a particular dose of a particular therapeutic, comprising the steps of: 

a) detecting the level, amount or activity of an IL-l protein; or an IL-l 
rnRNA or DNA in a sample obtained from a subject; 

b) administering the particular dose of the particular therapeutic to the 
subject; detecting the level, amount or activity of an IL-l protein; or an IL-l rnRNA or DNA 
in a sample obtained from a subject; and 

c) comparing the relative level, amount or activity obtained in step a) 
with the level, amount or activity obtained in step b). 

22. A method of claim 21, wherein the therapeutic is a modulator of an IL-l 

activity. 



19. A method of claim 14, wherein the modulator is an agonist. 



20. A method of claim 14, wherein the modulator is an antagonist. 



23. A method of claim 22, wherein the IL-l activity is IL-la. 



24. A method of claim 22, wherein the IL-l activity is IL-l p. 
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25. A method of claim 22, wherein the IL-1 activity is IL-1RN 

26. A method of claim 21, wherein the therapeutic is a protein, peptide, 
peptidomimetic, small molecule or a nucleic acid. 

27. A method of claim 22, wherein the modulator is an agonist. 

28. A method of claim 22, wherein the modulator is an antagonist. 

29. A method for treating or preventing the development of a disease or 
condition that is associated with an IL-1 polymorphism in a subject comprising the steps of 
detecting the presence of at least one IL-l polymorphism comprising and IL-1 inflammatory 
haplotype and administering to the subject a therapeutic that compensates for a causative 
mutation that is in linkage disequilibrium with the at least one IL-1 polymorphism. 

30. A method of claim 29, wherein the detecting step is selected from the 
group consisting of: 



a) 


allele specific oligonucleotide hybridization; 


b) 


size analysis; 


c) 


sequencing; 


d) 


hybridization; 


e) 


5' nuclease digestion; 


f) 


single-stranded conformation polymorphism; 


g) 


allele specific hybridization; 


h) 


primer specific extension; and 


j) 


oligonucleotide ligation assay. 



3 1 . A method of claim 29, wherein prior to or in conjunction with detecting, 
the nucleic acid sample is subjected to an amplification step. 

32. A method of claim 29, wherein said amplification step employs a primer 
selected from the group consisting of any of SEQ ID Nos. 8-32. 
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33. A method of claim 30, wherein said size analysis is preceded by a 
restriction enzyme digestion. 

34. A method of claim 30, wherein the therapeutic is selected from the group 
consisting of: a modulator of an IL-1 activity. 



38. A method of claim 34, wherein the therapeutic is a protein, peptide, 
peptidomimetic, small molecule or a nucleic acid. 



4 1 . A method for screening for a therapeutic for treating or preventing a 
disease or condition that is associated with an IL-1 polymorphism comprising a 
proinflammatory haplotype comprising the steps of: 

a) combining an IL-1 polypeptide or bioactive fragment thereof, an IL-1 
binding partner and a test compound under conditions wherein, but for the test compound, the 
IL-1 protein and IL-1 binding partner are able to interact; and 

b) detecting the extent to which, in the presence of the test compound, an IL-1 
protein/IL-1 binding partner complex is formed, wherein an increase in the amount of 
complex formed by an agonist in the presence of the compound relative to in the absence of 
the compound or a decrease in the amount of complex formed by an antagonist in the 
presence of the compound relative to in the absence of the compound indicates that the 
compound is an effective therapeutic for treating or preventing the disease or condition. 



35. A method of claim 34, wherein the IL-1 activity is IL-la. 



36. A method of claim 34, wherein the IL-1 activity is IL-1 p. 



37. A method of claim 34, wherein the IL-1 activity is IL-lRa. 



39. A method of claim 34, wherein the modulator is an agonist. 



40. A method of claim 34, wherein the modulator is an antagonist. 
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42. A method of claim 41, wherein the agonist or antagonist is selected from 
the group consisting of: a protein, peptide, peptidomimetic, small molecule or nucleic acid. 

43. A method of claim 42, wherein the nucleic acid is selected from the group 
consisting of: an antisense, ribozyme and triplex nucleic acid. 

44. A method of claim 41, which additionally comprises the step of preparing 
a pharmaceutical composition from the compound. 

45. A method of claim 41, wherein the IL-1 polypeptide is IL-la. 

46. A method of claim 41, wherein the IL-1 polypeptide is IL-1 p. 

47. A method of claim 41, wherein the IL-1 polypeptide is IL-lRa. 

48. A method for identifying a therapeutic for treating or preventing a disease 
or condition that is associated with an IL-l polymorphism that comprises an inflammatory 
haplotype, comprising the steps of: 

a) contacting an appropriate amount of a candidate compound with a cell or 
cellular extract, which expresses an IL-1 gene; and 

b) determining the resulting protein bioactivity, wherein a decrease of an 
agonist bioactivity or a decrease in an antagonist bioactivity in the presence of the compound 
as compared to the bioactivity in the absence of the compound indicates that the candidate is 
an effective therapeutic. 

49. A method of claim 48, wherein the modulator is an antagonist of an IL-la 
or an IL-1 (3, bioactivity. 

50. A method of claim 48, wherein the modulator is an agonist of an IL-1RN 

bioactivity. 
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51. A method of claim 48, wherein in step (b), the protein bioactivity is 
determined by determining the expression level of an IL-1 gene, 

52. A method of claim 51, wherein the expression level is determined by 
detecting the amount of mRNA transcribed from an IL-1 gene. 

53. A method of claim 5 1 , wherein the expression level is determined by 
detecting the amount of the IL-1 product produced. 

54. A method of claim 51, wherein the expression level is determined using 
an anti- IL-1 antibody in an immunodetection assay. 

55. A method of claim 51, which additionally comprises the step of preparing 
a pharmaceutical composition from the compound. 

56. A method of claim 5 1 , wherein said cell is contained in an animal. 

57. A method of claim 56, wherein the animal is transgenic. 
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DNA Sequence of the human IL-1A gene. (GenBank Accession No. X03833) 

1437 AAGCTTCTAC CCTAGTCTGG TGCTACACTT ACATTGCTTA CATCCAAGTG TGGTTATTTC 

1377 TGTGGCTCCT GTTATAACTA TTATAGCACC AGGTCTATGA CCAGGAGAAT TAGACTGGCA 

1317 TTAAATCAGA ATAAGAGATT TTGCACCTGC AATAGACCTT ATGACACCTA ACCAACCCCA 

1257 TTATTTACAA TTAAACAGGA ACAGAGGGAA TACTTTATCC AACTCACACA AGCTGTTTTC 

1197 CTCCCAGATC CATGCTTTTT TGCGTTTATT ATTTTTTAGA GATGGGGGCT TCACTATGTT 

1137 GCCCACACTG GACTAAAACT CTGGGCCTCA AGTGATTGTC CTGCCTCAGC CTCCTGAATA 

1077 GCTGGGACTA CAGGGGCATG CCATCACACC TAGTTCATTT CCTCTATTTA AAATATACAT 

1017 GGCTTAAACT CCAACTGGGA ACCCAAAACA TTCATTTGCT AAGAGTCTGG TGTTCTACCA 

-957 CCTGAACTAG GCTGGCCACA GGAATTATAA AAGCTGAGAA ATTCTTTAAT AATAGTAACC 

-897 AGGCAACATC ATTGAAGGCT CATATGTAAA AATCCATGCC TTCCTTTCTC CCAATCTCCA 

-837 TTCCCAAACT TAGCCACTGG TTCTGGCTGA GGCCTTACGC ATACCTCCCG GGGCTTGCAC 

-777 ACACCTTCTT CTACAGAAGA CACACCTTGG GCATATCCTA CAGAAGACCA GGCTTCTCTC 

-717 TGGTCCTTGG TAGAGGGCTA CTTTACTGTA ICKIOOCCld GGTGGAGAGT TCTCTCCTGA 

-657 AGCTCCATCC CCTCTATAGG AAATGTGTTG ACAATATTCA GAAGAGTAAG AGGATCAAGA 

-597 CTTCTTTGTG CTCAAATACC ACTGTTCTCT TCTCTACCCT GCCCTAACCA GGAGCTTGTC 

-537 ACCCCAAACT CTGAGGTGAT TTATGCCTTA ATCAAGCAAA CTTCCCTCTT CAGAAAAGAT 

-477 GGCTCATTTT CCCTCAAAAG TTGCCAGGAO CTGCCAAGTA TTCTGCCAAT TCACCCTGGA 

-417 GCACAATCAA CAAATTCAGC CAGAACACAA CTACAGCTAC TATTAGAACT ATTATTATTA 

-357 ATAAATTCCT CTCCAAATCT AGCCCCTTGA CTTCGGATTT CACGATTTCT CCCTTCCTCC 

-297 TAGAAACTTG ATAAGTTTCC CGCGCTTCCC TTTTTCTAAG ACTACATGTT TGTCATCTTA 

-237 TAAAGCAAAG GGGTGAATAA ATGAACCAAA TCAATAACTT CTGGAATATC TGCAAACAAC 

-177 AATAATATCA GCTATGCCAT CTTTCACTAT TTTAGCCAGT ATCGAGTTGA ATGAACATAG 

-117 AAAAATACAA AACTGAATTC TTCCCTGTAA ATTCCCCGTT TTGACGACGC ACTTGTAGCC 

-57 ACGTAGCCAC GCCTACTTAA GACAATTACA AAAGGCGAAG AAGACTGACT CAGGCTTAAG 

4 CTGCCAGCCA GAGAGGGAGT CATTTCATTG GCGTTTGAGT CAGCAAAGGT ATTGTCCTCA 

64 CATCTCTGGC TATTAAAGTA TTTTCTGTTG TTGTTTTTCT CTTTGGCTGT TTTCTCTCAC 

124 ATTGCCTTCT CTAAAGCTAC AGTCTCTCCT TTCTTTTCTT GTCCCTCCCT GGTTTGGTAT 

184 GTGACCTAGA ATTACAGTCA GATTTCAGAA AATGATTCTC TCATTTTGCT GATAAGGACT 

244 GATTCGTTTT ACTGAGGGAC GGCAGAACTA GTTTCCTATG AGGGCATGGG TGAATACAAC 

304 TGAGGCTTCT CATGGGAGGG AATCTCTACT ATCCAAAATT ATTAGGAGAA AATTGAAAAT 

364 TTCCAACTCT GTCTCTCTCT TACCTCTGTG TAAGGCAAAT ACCTTATTCT TGTGGTGTTT 

424 TTGTAACCTC TTCAAACTTT CATTGATTGA ATGCCTGTTC TGGCAATACA TTAGGTTGGG 

464 CACATAAGGA ATACCAACAT AAATAAAACA TTCTAAAAGA AGTTTACGAT CTAATAAAGG 

544 AGACAGGTAC ATAGCAAACT AATTCAAAGG AGCTAGAAGA TGGAGAAAAT GCTGAATGTG 

604 GACTAAGTCA TTCAACAAAG TTTTCAGGAA GCACAJLAGAG GAGGGGCTCC CCTCACAGAT 

664 ATCTGGATTA GAGGCTGGCT GAGCTGATGG TGGCTGGTGT TCTCTGTTGC AGAAGTCAAG 

724 ATGGCCAAAG TTCCAGACAT GTTTGAAGAC CTGAAGAACT GTTACAGGTA AGGAATAAGA 

784 TTTATCTCTT GTGATTTAAT GAGGGTTTCA AGGCTCACCA GAATCCAGCT AGGCATAACA 

844 GTGGCCAGCA TGGGGGCAGG CCGGCAGAGG TTGTAGAGA7 GTGTACTAGT CCTGAAGTCA 

904 GAGCAGGTTC AGAGAAGACC CAGAAAAACT AAGCATTCAG CATGTTAAAC TGAGATTACA 

964 TTGGCAGGGA GACCGCCATT TTAGAAARAT TATTTTTGAG GTCTGCTGAG CCCTACATGA 

1024 ATATCAGCAT CAACTTAGAC ACAGCCTCTG TTGAGATCAC ATGCCCTGAT ATAAGAATGG 

1084 GTTTTACTGG TCCATTCTCA GGAAAACTTG ATCTCATTCA GGAACAGGAA ATGGCTCCAC 

Fig. 3 
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1144 AGCAAGCTGG GCATGTGAAC TCACATATGC AGGCAAATCT CACTCAGATG TAGAAGAAAG 
1204 GTAAATGAAC ACAAAGATAA AATTACGGAA CATATTAAAC TAACATGATG TTTCCATTAT 
1264 CTGTAGTAAA TACTAACACA AACTAGGCTG TCAAAATTTT GCCTGGATAT TTTACTAAGT 
1324 ATAAATTATG AAATCTGTTT TAGTGAATAC ATGAAA0TAA TGTGTAACAT ATAAT CTATT 
1384 TGGTTAAAAT AAAAAGGAAG TGCTTCAAAA CCTTTCTTTT CTCTAAAGGA GCTTAACATT 
1444 CTTCCCTGAA CTTCAATTAA AGCTCTTCAA TTTGTTAGCC AAGTCCAATT TTTACAGATA 
1504 AAGCACAGGT AAAGCTCAAA GCCTGTCTTG ATGACTACTA ATTCCAGATT AGTAAGATAT 
1564 GAATTACTCT ACCTATGTGT ATGTGTAGAA GTCCTTAAAT TTCAAAGATG ACAGTAATGG 
1624 CCATGTGTAT GTGTGTGACC CACAACTATC ATGGTCATTA AAGTACATTG GCCAGAGACC 
1684 ACATGAAATA ACAACAATTA CATTCTCATC ATCTTATTTT GACAGTGAAA ATGAAGAAGA 
1744 CAGTTCCTCC ATTGATCATC TGTCTCTGAA TCAGGTAAGC AAATGACTGT AATTCTCATG 
1804 GGACTGCTAT TCTTACACAG TGGTTTCTTC ATCCAAAGAG AACAGCAATG ACTTGAATCT 
1864 TAAATACTTT TGTTTTACCC TCACTAGAGA TCCAGAGACC TGTCTTTCAT TATAAGTGAG 
1924 ACCAGCTGCC TCTCTAAACT AATAGTTGAT GTGCATTGGC TTCTCCCAGA ACAGAGCAGA 
1984 ACTATCCCAA ATCCCTGAGA ACTGGAGTCT CCTGGGGCAG GCTTCATCAG GATGTTAGTT 
2044 ATGCCATCCT GAGAAAGCCC CGCAGGCCGC TTCACCAGGT GTCTGTCTCC TAACGTGATG 
2104 TGTTGTGGTT GTCTTCTCTG ACACCAGCAT CAGAGGTTAG AGAAAGTCTC CAAACATGAA 
2164 GCTGAGAGAG AGGAAGCAAG CCAGCTGAAA GTGAOAAGTC TACAGCCACT CATCAATCTG 
2224 TGTTATTGTG TTTGGAGACC ACAAATAGAC ACTATAAGTA CTGCCTAGTA TGTCTTCAGT 
2284 ACTGGCTTTA AAAGCTGTCC CCAAAGGAGT ATTTCTAAAA TATTTTGAGC ATTGTTAAGC 
2344 AGATTTTTAA CCTCCTGAGA GGGAACTAAT TGGAAAGCTA CCACTCACTA CAATCATTGT 
2404 TAACCTATTT AGTTACAACA TCTCATTTTT GAGCATGCAA ATAAATGAAA AAGTCTTCCT 
2464 AAAAAAATCA TCTTTTTATC CTGGAAGGAG GAAGGAAGGT GAGACAAAAG GGAGAGAGGG 
2524 AGGGAAGCCT AATGAAACAC CAGTTACCTA AGACCAGAAT GGAGATCCTC CTCACTACCT 
2584 CTGTTGAATA CAGCACCTAC TGAAAGAACT TTCATTCCCT GACCATGAAC AGCCTCTCAG 
2644 CTTCTGTTTT CCTTCCTCAC AGAAATCCTT CTATCATGTA AGCTATGGCC CACTCCATGA 
2704 AGGCTGCATG GATCAATCTG TGTCTCTGAG TATCTCTGAA ACCTCTAAAA CATCCAAGCT 
2764 TACCTTCAAG GAGAGCATGG TGGTAGTAGC AACCAACGGG AAGGTTCTGA AGAAGAGACG 
2824 GTTGAGTTTA AGCCAATCCA TCACTGATGA TGACCTGGAG GCCATCGCCA ATGACTCAGA 
2884 GGAAGGTAAG GGGTCAAGCA CAATAATATC TTTCTTTTAC AGTTTTAAGC AAGTAGGGAC 
2944 AGTAGAATTT AGGGGAAAAT TAAACGTGGA GTCAGAATAA CAAGAAGACA ACCAAGCATT 
3004 AGTCTGGTAA CTATACAGAG GAAAATTAAT TTTTATCCTT CTCCAGGAGG GAGAAATGAG 
3064 CAGTGGCCTG AATCGAGAAT ACTTGCTCAC AGCCATTATT TCTTAGCCAT ATTGTAAAGG 
3124 TCGTGTGACT TTTAGCCTTT CAGGAGAAAG CAGTAATAAG ACCACTTACG AGCTATGTTC 
3184 CTCTCATACT AACTATGCCT CCTTGGTCAT GTTACATAAT CTTTTCGTGA TTCAGTTTCC 
3244 TCTACTGTAA AATGGAGATA ATCAGAATCC CCCACTCATT GGATTGTTGT AAAGATTAAG 
3304 AGTCTCAGGC TTTACAGACT GAGCTAGCTG GGCCCTCCTG ACTGTTATAA AGATTAAATG 
3364 AGTCAACATC CCCTAACTTC TGGACTAGAA TAATGTCTGG TACAAAGTAA GCACCCAATA 
3424 AATGTTAGCT ATTACTATCA TTATTATTAT TATTTTATTT TTTTTTTTTG AGATGGAGTC 
3484 TGGCTCTGTC ACCCAGGCTG GAGTGCAGTG GCACAATCTC GGCTCACTGC AAGCTCTGCC 
3544 TCCTGGGTTC ATGCCATTCT CCTGCCTCAG CCTCCCGAGT AAGCTGGGAA TACAGGCACC 
3604 CGCCACTGTT CCCGGCTAAT TTTTTGTATT TTTAGTAGAG ACGGAGTTTC ACCGTGGTCT 
3664 CCATCTCCTC GTGATCCACC CACCTTGGCC TCCCAAAGTG CCGGGATTAC AGGCGTOAGC 
3724 CACCGCGCCC GGCCTATTAT TATTATTATT ACTACTACTA CTACCTATAT GAATACTACC 
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3784 AGCAATACTA ATTTATTAAT GACTGGATTA TGTCTAAACC TCACAAGAAT CCTACCTTCT 
3844 CATTTTACAT AAAA6GAAAC TAAGCTCAT? GAGATAGGTA AACTGCCCAA TOGCATACAT 
3904 CTGTAAGTGG GAGAGCCTCA AATCTAATTC AGTTCTACCT GAGTAAAAAA ATCATGGTTT 
3964 CTCCTCCATC CCTTTACTGT ACAAGCCTCC ACATGAACTA TAAACCCAAT ATTCCTGTTT 
4024 TTAAGATAAT ACCTAAGCAA TAACGCATGT TCACCTAGAA GGTTTTAAAA TGTAACAAAA 
4084 TATAAGAAAA TAAAAATCAC TCATATCGTC AGTGAGAGTT TACTACTGCC AGCACTATGG 
4144 TATGTTTCCT TAAAATCTTT GCTATACACA TACCTACATG TGAACAAATA TGTCTAACAT 
4204 CAAGACCACA CTATTTACAA CTTTATATCC AGCTTTTCTT ACTTAGCAAT GTATTGAGGA 
4264 CATTTTAGAG TGCCCGTTTT TCACCATTAT AAGCAATGCA ACAATGAACA TCTGTATAAA 
4324 TAAATATTCA TTTCTCTCAC CCTTTATTTC CTTAGAATAT ATTCCTAGAA GTAGAATTTC 
4384 CCAGAGCCAT GAGGATTTGT GACGCTATTG ATATGTGCCA CTTTGCACTC TCTGTGACAT 
4444 ATATAATTAT TTTTAATGCA TTCATTTTTT TCTCAGAGTG CATTCGTTTG AAAACATAGA 
4504 CGGGAAATAC TGGTAGTCTT CCTTGTCAGT TAGAAACACC CAAACAATGA AAAATGAAAA 
4564 AGTTGCACAA ATAGTCTCTA AAAACAATGA AACTATTGCC TGAGGAATTG AAGTTTAAAA 
4624 AGAAGCACAT AAGCAACAAC AAGGATAATC CTAGAAAACC AGTTCTGCTG ACTGGGTGAT 
4684 TTCACTTCTC TTTGCTTCCT CATCTGGATT GGAATATTCC TAATACCCCC TCCAGAACTA 
4744 TTTTCCCTGT TTGTACTAGA CTGTGTATAT CATCTGTGTT TGTACATAGA CATTAATCTG 
4804 CACTTGTGAT CATGGTTTTA GAAATCATCA AGCCTAGGTC ATCACCTTTT AGCTTCCTGA 
4864 GCAATGTGAA ATACAACTTT ATGAGGA7CA TCAAATACGA ATTCATCCTG AATGACGCCC 
4924 TCAATCAAAG TATAATTCGA GCCAA7GATC AGTACCTCAC GGCTGCTGCA TTACATAATC 
4984 TGGATGAAGC AGGTACATTA AAATGGCACC AGACATTTCT GTCATCCTCC CCTCCTTTCA 
5044 TTTACTTATT TATTTATTTC AATCTTTCTG CTTGCAAAAA ACATACCTCT TCAGAGTTCT 
5104 GGGTTGCACA ATTCTTCCAG AATAGCTTGA AGCACAGCAC CCCCATAAAA ATCCCAAGCC 
5164 AGGGCAGAAG GTTCAACTAA ATCTGGAAGT TCCACAAGAG AGAAGTTTCC TATCTTTGAG 
5224 AGTAAAGGGT TGTGCACAAA GCTAGCTGAT GTACTACCTC TTTGGTTCTT TCAGACATTC 
5284 TTACCCTCAA TTTTAAAACT GAGGAAACTG TCAGACATAT TAAATGATTT ACTCAGATTT 
5344 ACCCAGAAGC CAATGAAGAA CAATCACTCT CCTTTAAAAA GTCTGTTGAT CAAACTCACA 
5404 AGTAACACCA AACCAGGAAG ATCTTTATTA TCTCTGATAA CATATTTGTG AGGCAAAACC 
5464 TCCAATAAGC TACAAATATG GCTTAAAGGA TGAAGTTTAG TGTCCAAAAA CTTTTATCAC 
5524 ACACATCCAA TTTTCATGGC GGACATGTTT TAGTTTCAAC AGTATACATA TTTTCAAAGG 
5584 TCCAGAGAGG CAATTTTGCA ATAAACAAGC AAGACTTTTT CTGATTGGAT GCACTTCAGC 
5644 TAACATGCTT TCAACTCTAC ATTTACAAAT TATTTTGTGT TCTATTTTTC TACTTAATAT 
5704 TATTTCTGCA ATTTTCCCAA TATTGACATC GTGTATGTAT TTGCCATTTT TAATATCACT 
5764 AGACAATTCA ATCAGGTTGC TACGTTGGTC CCTTGGGTTT ACTCTAAATA GCTTGATTGC 
5824 AAATATCTTT GTATATATTA TTGTTTTTTC TCCTATCTTG TAATTTCTTT GAGCACATCC 
5884 CAAAGAGGAA TGCCTAGATC AATGGGCACA AATAATTTGA CAGCTCTTAT TAAACATTAT 
5944 TCTGTAAGTA AAAACTGAAC TACTTTTCAG TATCACTAGC AACATATGAG TGTATCAGCT 
6004 TCCTAAACCC CTCCATGTTA GGTCATTATG AACTTATGAT CTAACAAATT ACAGGGTCTT 
6064 ATCCCACTAA TGAAATTATA AGAGATTCAA CACTTATTCA GCCCCGAAGG ATTCATTCAA 
6124 CGTAGAAAAT TCTAAGAACA TTAACCAAGT ATTTACCTGC CTAGTGAGTG TGGAAGACAT 
6184 TGTGAAGGAC ACAAAGATGT ATAGAATTCC ATTCCTGACT TCCAGGTATT TACACCATAG 
6244 GTGGGGACCT AACTACACAC ACACACACAC ACACACACAC ACACACACAC ACCATGCACA 
6304 CACAATCTAC ATCAACACTT GATTTTATAC AAATACAATG AATTTACTTT CTTTTTGGTT 
6364 CTTCTCTTCA CCAGTGAAAT TTGACATGGG TGCTTATAAG TCATCAAAGG ATGATGCTAA 
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6424 AATTACCGTG ATTCTAAGAA TCTCAAAAAC TCAATTGTAT GTGACTGCCC AAGATGAAGA 
6484 CCAACCAGTG CTGCTGAAGG TCAGTTGTCC TTTGTCTCCA ACTTACCTTC ATTTACATCT 
6544 CATATGTTTG TAAATAAGCC CAATAGGCAG ACACCTCTAA CAAGGTGACA CTGTCCTCTT 
6604 TCCTTCCTAC CACAGCCCCC ACCTACCCAC CCCACTCCCA TTGATTCCAG AGGCGTGCCT 
6664 AGGCAGGATC TATGAGAAAA TATAACAGAG AGTAAGAGGA AAATTACCTT CTTTCTTTTT 
6724 CCTTTCCCTG CCTGACCTTA TTCACCTCCC ATCCCAGAGC ATCCATTTAT TCCATTGATC 
6784 TTTACTGACA TCTATTATCT GACCTACACA ATACTAGACA TTAGGACAAT GTGGCCTGCC 
6844 TCCAAGAAAC TCAAATAAGC CAACTGAGAT CAGAGAGGAT TAATCACCTG CCAATGGGCA 
6904 CAAAGCAACA AGCTGGGAGC CAAGTCCCAA AATGGGGCCT GCTGCTTCCA GTTCCCCTCT 
6964 CTCTGCATTG ATGTCAGCAT TATCCTTCGT CCCAGTCCTG TCTCCACTAC CACTTTCCCC 
7024 CTCAAACACA CACACACACA ACAGCCTTAG ATGTTTTCTC CACTGATAAG TAGGTGACTC 
7084 AATTTGTAAG TATATAATCC AAGACCTTCT ATTCCCAAGT AGAATTTATG TGCCTGCCTG 
7144 TGCTTTTCTA CCTGGATCAA GTGATGTCTA CAGAGTAGGG CAGTAGCTTC ATTCATGAAC 
7204 TCATTCAACA AGCATTATTC ACTGAGAGCC TTGTATTTTT CAGGCATAGT GCCAACAGCA 
7264 GTGTGGACAG TGGTGCATCA AAGCCTCTAG TCTCATAGAA CTTAGTCTTC TGGAGGATAT 
7324 GGAAAACAGA CAACCCAAAC AACCAACAAA AGAGCAAGAT GCTGCAAAAA AALLAAAAAT 
7384 GAATAGGGTG CTAAGATAGA GAAAAGTGGG AGAGTGCTAT TTAGACAAAG TGGTAAAAAC 
7444 AAAGCCCCTT GTGAGATGAG AGCTGCCGAC AGAGGGGGCG GGTCATGGTT GTGGGTTTTT 
7504 GGGTAGGACA TTCAGAGGAG GGGOCGGGTC GTGGTTGTGG GTTTTTGGGT AGGACATTCA 
7564 GAGGAGGGGG CGGOTCGTGG TTGTGGGTTT TTGGGTAGGA CATTCAGAGG AGGGGGCGGG 
7624 TCGTGGTTGT GGGTTTTTGG GTAGGACATT CAGAGGAGGG GGCGGGTCGT GGTTGTGGGT 
7684 TTTTGGGACA TTCAGAGGAG TCTGAATGCA CCCAGGCCTA CAACTTCAAG ATGGTAAAGG 
7744 ACAGCTCCAA GGATCAGAAG AAGCATTCTT GGAACTGGGG CATTTTGAGA AGGAGGAAAA 
7804 ATATGCAGAG ACTAGTGCTT GCAGAGCTTG CATTTGGATT TCATTTGAGG TACAATGAAA 
7864 ACCCATTAAT GGGTTTCACA CAGTGCAATG GCCTGACCTC ACTTATATTT CCTAAAATAG 
7924 AAAACAGATC AGAAGGAAGG CAATAGAGAA GCAGAAAGTC CAATGAGGAG GTTTCACAGC 
7984 AGTCATGGGG GTGGGGTAAG GAAAAGAAGT GGAAAGAAAC AGACAGAATT GGGTTATATT 
8044 TTGGAGATAG AACCAACAGA AGGAAGAGGA GAAACAACAT TTACTGAGAA GGGAAAAAGT 
8104 AGGAGAGGAA TAGGTTTGGG AAATAAATCC TGCTGACATT GGAAACCCCA AGGAAGCCTC 
8164 AAAAGTATAT TTACTTGCTT TAGATTTAAA AGAATAGGAA AGAAGCATCT CAACTTGGAA 
8224 TTTGAAATCT ATTTTTCCAT AAAAGTATTG TTAAATTCTA CTCATACTCA CAAGAAAAGT 
8284 ACATTCTAAA GAGTATATTG AAAGAGTTTA CTGATATACT TAGGAATTTT GTGTGTATGT 
8344 GTGTGTGTGT ATGTGTGTGT GTGTGTTTAA CCTTCAATTG TTGACTTAAA TACTGAGATA 
8404 AATGTCATCT AAATGCTAAA TTGATTTCCC AAAGGTATGA TTTGTTCACT TGGAGATCAA 
8464 AATGTTTAGG GGGCTTAGAA TCACTGTAGT GCTCAGATTT GATGCAAAAT GTCTTAGGCC 
8524 TATGTTGAAG GCAGGACAGA AACAATGTTT CCCTCCTACC TGCCTGGATA CAGTAAGATA 
8584 CTAGTGTCAC TGACAATCTT CATAACTAAT TTAGATCTCT CTCCAATCAA CTAAGGAAAT 
8644 CAACTCTTAT TAATAGACTG GGCCACACAT CTACTAGGCA TGTAATAAAT GCTTGCTGAA 
8704 TGAACAAATG AATGAAGAGC CTATAGCATC ATGTTACAGC CATAGTCCTA AAGTGGTGTT 
8764 TCTCATGAAG GCCAAATGCT AAGGGATTGA GCTTCAGTCC TTTTTCTAAC ATCTTGTTCT 
8824 CTAACAGAAT TCTCTTCTTT TCTTCATAGG AGATGCCTGA GATACCCAAA ACCATCACAG 
8884 GTAGTGAGAC CAACCTCCTC TTCTTCTGGG AAACTCACGG CACTAAGAAC TATTTCACAT 
8944 CAGTTGCCCA TCCAAACTTG TTTATTGCCA CAAAGCAAGA CTACTGGGTG TGCTTGGCAG 
9004 GGGGGCCACC CTCTATCACT GACTTTCAGA TACTGGAAAA CCAGGCGTAG GTCTGGAGTC 
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9064 TCACTTGTCT CACTTGTGCA GTGTTGACAG TTCATATGTA CCATGTACAT GAAGAAGCTA 

9124 AATCCTTTAC TGTTAGTCAT TTGCTGAGCA TOTACTGAGC CTTOTAATTC TAAATGAATG 

9184 TTTACACTCT TTGTAAQAG? QflMCCMCt CTAACATATA ATGTTGTTAT TTAAAQAACA 

9244 CCCTATATTT TGCATAQTAC CAATCATTTT AATTATTATT CTTCATAACA ATTTTAGGAG 

9304 GACCAGAGCT ACTGACTATG GCTACCAARA AGACTCTACC CATATTACAG ATGGGCAAAT 

9364 TAAGGCATAA GAAAACTAAG AAATATGCAC AATAGCAGTT GAAACAAGAA GCCACAGACC 

9424 TAGGATTTCA TGATTTCATT TCAACTGTTf GCCTTCTGCT TTTAAGTTGC TGATGAACTC 

9484 TTAATCAAAT AGCATAAGTT TCTGGGACCT CAGTTTTATC ATTTTCAAAA TGGAGGGAAT 

9544 AATACCTAAG CCTTCCTGCC GCAACAGTTT TTTATGCTAA TCAGGGAGGT CATTTTGGTA 

9604 AAATACTTCT CGAAGCCGAG CCTCAAGATG AAGGCAAAGC ACGAAATGTT ATTTTTTAAT 

9664 TATTATTTAT ATATGTATTT ATAAATATAT TTAAGATAAT TATAATATAC TATATTTATG 

9724 GGAACCCCTT CATCCTCTGA OTGTQACCAG GCATCCTCCA CAATAGCAGA CAGTGTTTTC 

9784 TGGGATAAGT AAGTTTGATT TCATTAATAC AGGGCATTTT GGTCCAAGTT GTGCTTATCC 

9844 CATAGCCAGG AAACTCTGCA TTCTAGTACT TGGGAGACCT GTAATCATA7 AATAAATGTA 

9904 CATTAATTAC CTTGAGCCAG TAATTGGTCC GATCTTTGAC TCTTTTGCCA TTAAACTTAC 

9964 CTGGGCATTC TTGTTTCATT CAATTCCACC TGCAATCAAG TCCTACAAGC TAAAATTAGA 

10024 TGAACTCAAC TTTGACAACC ATGAGACCAC TGTTATCAAA ACTTTCTTTT CTGGAATGTA 

10084 ATCAATGTTT CTTCTAGGTT CTAAAAATTG TGATCAGACC ATAATGTTAC ATTATTATCA 

10144 ACAATAGTGA TTGATAGAGT GTTATCAGTC ATAACTAAAT AAAGCTTGCA ACAAAATTCT 

10204 CTGACACATA GTTATTCATT GCCTTAATCA TTATTTTACT GCATGGTAAT TAGGGACAAA 

10264 TGGTAAATGT TTACATAAAT AATTGTATTT AGTGTTACTT TATAAAATCA AACCAAGATT 

10324 TTATATTTTT TTCTCCTCTT TGTTAflCTGC CAGTATGCAT AAATGGCATT AAGAATGATA 

10384 ATATTTCCGG GTTCACTTAA AGCTCATATT ACACATACAC AAAACATGTG TTCCCATCTT 

10444 TATACAAACT CACACATACA GAGCTACATT AAAAACAACT AATAGGCCAG GCACGG7GGC 

10504 TCAGACCTGT AATCCCAGCA CTTTGGGAGG 
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DNA Sequence of the human 1MB gene. (GenBank Accession No. X04500) 

1933 AGAAAGAAAG AGAGAGAGAA KUUIUUU (UfifiUOTU GGAAGGAAGG AAGAAAGACA 
1873 G6CTCTGAGG AAGGTGGCAG TTCCTACAAC GGGAGAACCA GTGGTTAATT TGCAAAGTGG 
1813 ATCCTGTGGA GGCAWCAGA GGAGTCCCCT AGGCCACCCA GACAGGGCTT TTAGCTATcT 
1753 GCAGGCCAGA CACCAAATTT CAGGAGGGCT CAGTGTTAGG AATGGATTAT GGCTTATCAA 
1693 ATTCACAGGA AACTAACATG TTGAACAGCT TTTAGATTTC CTGTGGAAAA TATAACTTAC 
1633 TAAAGATGGA GTTCTTGTGA C7GACTCCTG ATATCAAGAT ACTGGGAGCC AAATTAAAAA 
1573 TCAGAAGGCT GCTTGGAGAG CAAGTCCATG AAATGCTCTT TTTCCCACAG TAGAACCTAT 
1513 TTCCCTCGTG TCTCAAATAC TTGCACAGAG GCTCACTCCC TTGGATAATG CAGAGCGAGC 
1453 ACGATACCTG GCACATACTA ATTTGAATAA AATGCTGTCA AATTCCCATT CACCCATTCA 
1393 AGCAGCAAAC TCTATCTCAC CTGAATG7AC ATGCCAGGCA CTGTGCTAGA CTTGGCTCAA 
1333 AAAGATTTCA GTTTCCTGGA GGAACCAGGA GGGCAAGGTT TCAACTCAGT GCTATAAGAA 
1273 GTGTTACAGG CTGGACACGG TGGCTCACGC CTGTAATCCC AACATTTGGG AGGCCGAGGC 
1213 GGGCAGATCA CAAGGTCAGG AGATCOAGAC CATCCTGGCT AACATGGTGA AACCCTGTCT 
1153 CTACTAAAAA TACAAAAAAT TAGCCGGGCG TTGGCGGCAG GTGCCTGTAG TCCCAGCTGC 
1093 TGGGGAGGCT GAGGCAGGAG AATGGTGTGA ACCCGGGAGG CGGAACTTGC AGGGGGCCGA 
1033 GATCGTGCCA CTGCACTCCA GCCTGGGCGA CAGAGTGAGA CTCTGTCTCA AAAAAAAAAA 
-973 AAAAGTGTTA TGATGCAGAC CTOTCAAAGA GGCAAAGGAG GGTGTTCCTA CACTCCAGGC 
-913 ACTGTTCATA ACCTGGACTC TCATTCATTC 7ACAAATGGA GGGCTCCCCT GGGCAGATCC 
-853 CTGGAGCAGG CACTTTGCTG GTGTCTCGGT TAAAOAGAAA CTGATAACTC TTGGTATTAC 
-793 CAAGAGATAG AGTCTCAGAT GGATATTCTT ACAGAAACAA TATTCCCACT TTTCAGAGTT 
-733 CACCAAAAAA TCATTTTAGG CAGAGCTCAT CTGGCATTGA TCTGGTTCAT CCATGAGATT 
-673 GGCTAGGGTA ACAGCACCTG GTCTTGCAGG GTTGTGTGAG CTTATCTCCA GGGTTGCCCC 
-613 AACTCCGTCA GGAGCCTGAA CCCTGCA7AC CGTATGTTCT CTGCCCCAGC CAAGAAAGG? 
-553 CAATTTTCTC CTCAGAGGCT CCTGCAATTG ACAGAGAGCT CCCGAGGCAG AGAACAGCAC 
-493 CCAAGGTAGA GACCCACACC CTCAATACAG ACAGGGAGGG CTATTGGCCC TTCATTGTAC 
-433 CCATTTATCC ATCTGTAAGT GGGAAGATTC CTAAACTTAA GTACAAAGAA GTGAATGAAG 
-373 AAAAGTATGT GCATGTATAA ATCTGTGTGT CTTCCACTTT GTCCCACATA TACTAAATTT 
-313 AAACATTCTT CTAACGTGGG AAAATCCAGT ATTTTAATGT GGACATCAAC TGCACAACGA 
-253 TTGTCAGGAA AACAATGCAT ATTTGCATGG TGATACATTT GCAAAATGTG TCATAGTTTG 
-193 CTACTCCTTG CCCTTCCATG AACCAGAGAA TTATCTCAGT TTATTAGTCC CCTCCCCTAA 
-133 GAAGCTTCCA CCAATACTCT TTTCCCCTTT CCTTTAACTT GATTGTGAAA TCAGGTATTC 
-73 AACAGAGAAA TTTCTCAGCC TCCTACTTCT GCTTTTGAAA GCTATAAAAA CAGCGAGGGA 
-13 GAAACTGGCA GATACCAAAC CTC7TCGAGG CACAAGGCAC AACAGGCTGC TCTGGGATTC 
48 TCTTCAGCCA ATCTTCATTG CTCAAGTATG ACTTTAATCT TCCTTACAAC TAGGTGCTAA 
108 GGGAGTCTCT CTGTCTCTCT GCCTCTTTGT GTGTATGCAT ATTCTCTCTC TCTCTCTCTT 
168 TCTTTCTCTG TCTCTCCTCT CCTTCCTCTC TGCCTCCTCT CTCAGCTTTT TGCAAAAATG 
228 CCAGGTGTAA TATAATGCTT ATGACTCGGG AAATATTCTG GGAATGGATA CTGCTTATCI 
288 AACAGCTGAC ACCCTAAAGG TTAGTGTCAA AGCCTCTGCT CCAGCTCTCC TAGCCAATAC 
238 ATTGCTAGTT GGGGTTTGGT TTAGCAAATG CTTTTCTCTA GACCCAAAGG ACTTCTCTTT 
308 CACACATTCA TTCATTTACT CAGAGATCAT TTCTTTGCAT GACTGCCATG CACTGGATGC 
468 TGAGAGAAAT CACACATGAA CGTAGCCGTC ATGGGGAAGT CACTCATTTT CTCCTTTTTA 
528 CACAGGTGTC TGAAGCAGCC ATGGCAGAAG TACCTGAGCT CGCCAGTGAA ATGATGGCT7 
588 ATTACAGGTC AGTGGAGACG CTGAGACCAG TAACATGAGC AGGTCTCCTC TTTCAAGAGT 
648 AGAGTGTTAT CTGTGCTTGG AGACCAGATT TTTCCCCTAA ATTGCCTCTT TCAGTGGCAA 
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708 ACAGGGTGCC AAGTAAATCT GATTTAAAGA 
768 TGGGACCTGG AGGCTATCCA GATGTGTTGT 
828 AGAAAAGATT CCAAGCCCAC AATACAAGGA 
888 AGGGAGAGCT CAGATTTTAG CTGACTCTGC 
948 AGGGAGCACC AGGGTGCCCA CCTGCCAGGC 
1008 ATATCACTGT GAAACTTGCC AGGTGTTTCA 
1068 TTCTTATGAA CAGTCAAGTC TTGTACACAG 
1128 TAATTGAGAC ATGTCCCTGT TTTTATTACA 
1188 CTGATGGCCC TAAACAGATG AAGGTAAGAC 
1248 CTCTAACACA GGGAAAGCTC AAAGAAGGGA 
1308 TGTTTTAGAA AGACTTTAAC CTCTTCCAGT 
1368 GCCACTTGGT CATCATATCA CCACAGTCAC 
1428 GGTGGTGACA GGGGAG6AGT AGTGATAATG 
1488 TCTTCAACAT AAATTTGATT ATCCTTTTAA 
1548 AGTTAAACTC TGAAACCAAG AGATGATCTT 
1608 TAGAATAGTT TTTTGCTACC TGGGGTGAAG 
1668 CAAAAAGATG AATTGAGACT TGAAAGAAAA 
1728 CATTTTACCC GCTTTGGACC TCATCTGAAA 
1788 ATTCCAGCAT CCTGCAACCT CCAGTTCTGA 
1848 GGCAGCAAAT GGTCATTTTT CAGACTCATC 
1908 TCCCTTCTGT TTTATATGAT GCTCAGTAGC 
1968 AGGTCAGTTG TGCAGGTTGG AGGCAGCCAC 
2028 TGATAGCCTC CCCTAGCCTC ATAATCCAGT 
2088 GAAGTTTTAA GACTGGCA7A ACTTCTTGGC 
2148 CTGCCTGGCC TTTGCCCCCC ATTGCCTCTT 
2208 ATTCTCTCCT GCTCAACTTT CTTTTGTGCA 
2268 CTTTGAACCA TTATCAGCCT TAAGGCAACC 
2328 AATAAGAGGA AAGTGGTAAC ATTTCACAAA 
2388 ATGAGACAGT GTTATGAAAA AGGAAAAAAA 
2448 GAGTGAGCAT AGGTGAATGG AAAATGTTAT 
2508 GTGACAGCAT TAGGGATACA AAAAGATATA 
2568 GCATGTACAA AAAGATGACA AGTAGAATCG 
2628 GTCCAGAAGC CACATTCTAG TCTTGAGTCT 
2688 CCCTTAACCT CCTTGAGCTT CAGAGAGGGA 
2748 GTTTTGTTTT GTTTTGTTTT GTTTTATGAG 
2808 GTGCAGTGGT ACAATCTTGG CTTACTGCAT 
2868 TTCCTCAGTC TCCTGAATAG CTAGGATTAC 
2928 TGTATTTTTA GTAGAGAAGG GGTTTCGCCA 
2988 CTAAATGATT CATCCACCTC GGCTTCCCAA 
3048 GCCTGGCCCA GAGAGGGATG ATCTTTAGAA 
3108 CTCTGAGCTT TCTACTCTCT GATGTCAAAG 
3168 CTCCCTCCCT CGCTCTCTCC GCAGTGCTCC 
3228 GGCGGCATCC AGCTACGAAT CTCCGACCAC 
3288 TCAGTTGTTG TGGCCATGGA CAAGCTGAGG 



CTACTTTCCC ATTACAAGTC CCTCCAGCCT 
7QCAAGGGCT TCCTGCAGAG GCAAATGGGG 
ATCCCTTTGC AAAGTGTGGC TTGGAGGGAC 
TGGGCTAGAG GTTAGGCCTC AAGATCCAAC 
CTAGAATCTG CCTTCTGGAC TGTTCTGCGC 
GGCAGCTTTG AGAGGCAGGC TGTTTGCAGT 
GGAAGGAAAA ATAAACCTGT TTAGAAGACA 
GTGGCAATGA GGATGACTTG TTCTTTGAAG 
TATGGGTTTA ACTCCCAACC CAAGGAAGGG 
GTTCTGGGCC ACTTTGATGC CATGGTATTT 
GAGACACAGG CTGCACCACT TGCTGACCTG 
TCACTAACGT TGGTGGTGGT GGCCACACTT 
TTCCCATTTC A7AGTAGGAA GACAACCAAG 
GAOATGOATT CAGCCTATGC CAATCACTTG 
GAGAACTAAC ATATGTCTAC CCCTTTTGAG 
CTTATAACAA CAAGACATAG ATGATATAAA 
CCATTCACTT GCTGTTTGAC CTTGACAAGT 
AATAAAGGGC TGAGCTGGAT GATCTCTGAG 
AATATTTTCA GTTGTAGCTA AGGGCATTTG 
CTTACAAAGA GCCATGTTAT ATTCCTGCTG 
CTTCCTAGGT GCCCAGCCAT CAGCCTAGCT 
TTTTCTCTGG CTTTATTTTA TTCCAGTTTG 
CCTCAATCTT GTTAAAAACA TATTTCTTTA 
TGCAGCTGTG GGAGGAGCCC ATTGGCTTGT 
CCAGCAGCTT GGCTCTGCTC CAGGCAGGAA 
CTTACAGGTC TCTTTAACTG TCTTTCAAOC 
TCA6TGAAGC CTTAATACGG AGCTTCTCTG 
AAGTACTCTC ACAGGATTTG CAGAATGCCT 
AGAACAGTGT AGAAAAATTG AATACTTGCT 
GGTCATCTGC ATGAAAAAGC AAATCATAGT 
GAGAAGGTAT ACATGTATGG TGTAGGTGGG 
GGATTTATTC TAAAGAATAG CCTGTAAGGT 
GCCTCTACCT GCTGTGTGCC CTTGAGTACA 
TAATCTTTTT ATTTTATTTT ATTTTATTTT 
ACAGAGTCTC ACTCTGTTGC CCAGGCTGGA 
CCTCCACCTC CTGAGTTCAA GCGATTCTCC 
AGGTGCACCC CACCACACCC AGCTAATTTT 
TGTTGGCCAG GCTGGTTTTG AAGTCCTGAC 
AGTGCTGGGA TTACAGGCAT GAGCCACCAC 
GCTCGGGATT CTTTCAAGCC CTTTCCTCCT 
CATGGTTCCT GGCAGGACCA CCTCACCAGG 
TTCCAGGACC TGGACCTCTG CCCTCTGGAT 
CACTACAGCA AGGGCTTCAG GCAGGCCGCG 
AAGATGCTGG TTCCCTGCCC ACAGACCTTC 



Fig, 4 continued 

SUBSTITUTE SHEET (RULE 26) 





WO 01/00880 



PCT/US00/18318 



10/16 



3348 CAGGAGAATG ACCTGAGCAC CTTCTTTCCC TTCATCTTTG AAGAAGGTAG TTAGCCAAGA 
3408 GCAGGCAGTA GATCTCCACT TGTQTCCTCT TGGAAGTCAT CAAGCCCCAQ CCAACTCAAT 
3468 TCCCCCAGAG CCAAAGCCCT TTAAAGGTAG AA6GCCCAGC GGGGAQACAA AACAAAGAAG 
3528 GCTGGAAACC AAAGCAATCA TCTCTTTAGT GGAAACTATT CTTAAAGAAG ATCTTGATGG 
3588 CTACTGACAT TTGCAACTCC CTCACTCTTT CTCAGGGGCC TTTCACTTAC ATTGTCACCA 
3648 GAGGTTCGTA ACCTCCCTGT GGGCTAGTGT TATGACCATC ACCATTTTAC CTAAGTAGCT 
3708 CTGTTGCTCG GCCACAGTGA GCAGTAATAG ACCTGAAGCT GGAACCCATG TCTAATAGTG 
3768 TCAGGTCCAG TGTTCTTAGC CACCCCACTC CCAGCTTCAT CCCTACTGGT GTTGTCATCA 
3828 GACTTTGACC GTATATGCTC AGGTGTCCTC CAAGAAATCA AATTTTGCCA CCTCGCCTCA 
3888 CGAGGCCTGC CCTTCTGATT TTATACCTAA ACAACATGTG CTCCACATTT CAGAACCTAT 
3948 CTTCTTCGAC ACATGGGATA ACGAGGCTTA TGTGCACGAT GCACCTGTAC GATCACTGAA 
4008 CTGCACGCTC CGGGACTCAC AGCAAAAAAG CTTGGTGATG TCTGGTCCAT ATGAACTGAA 
4068 AGCTCTCCAC CTCCAGGGAC AGGATATGGA GCAACAAGGT AAATGGAAAC ATCCTGGTTT 
4128 CCCTGCCTGG CCTCCTGGCA GCTTGCTAAT TCTCCATGTT TTAAACAAAG TAGAAAGTTA 
4188 ATTTAAGGCA AATGATCAAC ACAAGTGAAA AAAAATATTA AAAAGGAATA TACAAACTTT 
4248 GGTCCTAGAA ATGGCACATT TGATTGCACT GGCCAGTGCA TTTGTTAACA GGAGTGTGAC 
4308 CCTGAGAAAT TAGACGGCTC AAGCACTCCC AGGACCATGT CCACCCAAGT CTCTTGGGCA 
4368 TAGTGCAGTG TCAATTCTTC CACAATATGG GGTCATTTGA TGGACATGGC CTAACTGCCT 
4428 GTGGGTTCTC TCTTCCTGTT GTTGAGGCTG AAACAAGAGT GCTGGAGCGA TAATGTGTCC 
4488 ATCCCCCTCC CCAGTCTTCC CCCCTTGCCC CAACATCCGT CCCACCCAAT GCCAGGTGGT 
4548 TCCTTGTAGG GAAATTTTAC CGCCCAGCAG GAACTTATAT CTCTCCGCTG TAACGGGCAA 
4608 AAGTTTCAAG TGCGGTGAAC CCATCATTAG CTGTGGTOAT CTGCCTGGCA TCGTGCCACA 
4668 GTAGCCAAAG CCTCTGCACA GGAGTGTGGG CAACTAAGGC TGCTGACTTT GAAGGACAGC 
4728 CTCACTCAGG GGGAAGCTAT TTGCTCTCAG CCAGGCCAAG AAAATCCTGT TTCTTTGGAA 
4788 TCGGGTAGTA AGAGTGATCC CAGGGCCTCC AATTGACACT GCTGTGACTG AGGAAOATCA 
4848 AAATGAGTGT CTCTCTTTGG AGCCACTTTC CCAGCTCAGC CTCTCCTCTC CCAGTTTCTT 
4908 CCCATGGGCT ACTCTCTGTT CCTGAAACAG TTCTGGTGCC TGATTTCTGG CAGAAGTACA 
4968 GCTTCACCTC TTTCCTTTCC TTCCACATTG ATCAAGTTGT TCCGCTCCTG TGGATGGGCA 
5028 CATTGCCAGC CAGTGACACA ATGGCTTCCT TCCTTCCTTC CTTCAGCATT TAAAATGTAG 
5088 ACCCTCTTTC ATTCTCCGTT CCTACTGCTA TGAGGCTCTG AGAAACCCTC AGGCCTTTGA 
5148 GGGGAAACCC TAAATCAACA AAATGACCCT GCTATTGTCT GTGAGAAGTC AAGTTATCCT 
5208 GTGTCTTAGG CCAAGGAACC TCACTGTGGG TTCCCACAGA GGCTACCAAT TACATGTATC 
5268 CTACTCTCGG GGCTAGGGGT 7GGGGTGACC CTGCATGCTG TGTCCCTAAC CACAAGACCC 
5328 CCTTCTTTCT TCAGTGGTGT TCTCCATGTC CTTTGTACAA GGAGAAGAAA GTAATGACAA 
5388 AATACCTGTG GCCTTGGGCC TCAAGGAAAA GAATCTGTAC CTGTCCTGCG TGTTGAAAGA 
5448 TGATAAGCCC ACTCTACAGC TGGAGGTAAG TGAATGCTAT GGAATGAAGC CCTTCTCAGC 
5508 CTCCTGCTAC CACTTATTCC CAGACAATTC ACCTTCTCCC CGCCCCCATC CCTAGGAAAA 
5568 GCTGGGAACA GGTCTATTTG ACAAGTTTTG CATTAATGTA AATAAATTTA ACATAATTTT 
5628 TAACTGCGTG CAACCTTCAA TCCTGCTGCA GAAAATTAAA TCATTTTGCC GATGTTATTA 
5688 TGTCCTACCA TAGTTACAAC CCCAACAGAT TATATATTGT TAGGGCTGCT CTCATTTGAT 
5748 AGACACCTTG GGAAATAGAT GACTTAAAGG GTCCCATTAT CACGTCCACT CCACTCCCAA 
5808 AATCACCACC ACTATCACCT CCAGCTTTCT CAGCAAAAGC TTCATTTCCA AGTTGATGTC 
5868 ATTCTAGGAC CATAAGGAAA AATACAATAA AAAGCCCCTG GAAACTAGGT ACTTCAAGAA 
5928 GCTCTAGCTT AATTTTCACC CCCCCAAAAA AAAAAAATTC TCACCTACAT TATGCTCCTC 
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5988 AGCATTTGGC ACTAAGTTTT AGMMQAAO AAGGGCTCTT TTAATAATCA CACAGAAAGT 
6048 TGGGGGCCCA GTTACAACTC AGGAGTCTGG CTCCTGATCA TGTGACCTGC TCGTCAGTTT 
6108 CCTTTCTGGC CAACCCAAAG AACATCTTTC CCATAGGCAT CTTTGTCCCT TGCCCCACAA 
6168 AAATTCTTCT TTCTCTTTCG CTGCAGAGTG TAGATCCCAA AAATTACCCA AAGAAGAAGA 
6228 TGGAAAAGCG ATTTGTCTTC AACAAGATAG AAATCAATAA CAAGCTGGAA TTTGAGTCTG 
6288 CCCAGTTCCC CAACTGGTAC ATCAGCACCT CTCAAGCAGA AAACATGCCC GTCTTCCTGG 
6348 GAGGGACCAA AGGCGGCCAG GATATAACTG ACTTCACCAT GCAATTTGTG TCTTCCTAAA 
6408 GAGAGCTGTA CCCAGAGAGT CCTGTGCTGA ATGTGGACTC AATCCCTAGG GCTGGCAGAA 
6468 AGGGAACAGA AAGGTTTTTG AGTACGGCTA TAGCCTGGAC TTTCCTGTTG TCTACACCAA 
6528 TGCCCAACTG CCTGCCTTAG GGTAGTGCTA AGAGGATCTC CTGTCCATCA GCCAGGACAG 
6588 TCAGCTCTCT CCTTTCAGGG CCAATCCCCA GCCCTTTTGT TGAGCCAGGC CTCTCTCACC 
6648 TCTCCTACTC ACTTAAAGCC CGCCTGACAG AAACCACGGC CACATTTGGT TCTAAGAAAC 
6708 CCTCTGTCAT TCGCTCCCAC ATTCTGATGA GCAACCGCTT CCCTATTTAT TTATTTATTT 
6768 GTTTGTTTGT TTTGATTCAT TGGTCTAATT TATTCAAAGG GGGCAAGAAG TAGCAGTGTC 
6828 TGTAAAAGAG CCTAGTTTTT AATAGCTATG GAATCAATTC AATTTGGACT GGTGTGCTCT 
6888 CTTTAAATCA AGTCCTTTAA TTAAGACTGA AAATATATAA GCTCAGATTA TTTAAATGGG 
6948 AATATTTATA AATGAGCAAA TATCATACTO TTCAATGGTT CTGAAATAAA CTTCACTGAA 
7008 GAAAAAAAAA AAAGGGTCTC TCCTGATCAT TGACTGTCTG GATTGACACT GACAGTAAGC 
7068 AAACAGGCTG TGAGAGTTCT TGGGACTAAG CCCACTCCTC ATTGCTGAGT GCTGCAAGTA 
7128 CCTAGAAATA TCCTTGGCCA CCGAAGACTA TCCTCCTCAC CCATCCCCTT TATTTCGTTG 
7188 TTCAACAGAA GGATATTCAG TGCACATCTG GAACAGGATC AGCTGAAGCA CTGCAGGGAG 
7248 TCAGGACTGG TAGTAACAGC TACCATGATT TATCTATCAA TGCACCAAAC ATCTGTTGAG 
7308 CAAGCGCTAT GTACTAGGAG CTGGGAGTAC AGAGATGAGA ACAGTCACAA GTCCCTCCTC 
7368 AGATAGGAGA GGCAGCTAGT TATAAGCAGA ACAAGGTAAC ATGACAAGTA GAGTAAGATA 
7428 GAAGAACGAA GAGGAGTAGC C*GQUGG*Q GGAGGAGAAC GACATAAGAA TCAAGCCTAA 
7488 AGGGATAAAC AGAAGATTTC CACACATGGG CTGGGCCAAT TGGGTGTCGG TTACGCCTGT 
7548 AATCCCAGCA CTTTGGGTGG CAGGGGCAGA AAGATCGCTT GAGCCCAGGA GTTCAAGACC 
7608 AGCCTGGGCA ACATAGTGAG ACTCCCATCT CTACAAAAAA TAAATAAATA AATAAAACAA 
7668 TCAGCCAGGC ATGCTGGCAT GCACCTGTAG TCCTAGCTAC TTGGGAAGCT GACACTGGAG 
7728 GATTGCTTGA GCCCAGAAGT TCAAGACTGC AGTGAGCTTA TCCGTTGACC TGCAGGTCGA 
7788 C 
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DNA Sequence of the human IL-1RN gene. IGenBank Accession No. X64532) 
-5988 GTCGACCTGC AGGTCAACGG ATCTOIGAGG 10AGT1GCTT CTTGTAGATA ACAGTTGGAT 
-5928 TATATACCAT QTCCTGATCC CCTTCATCAT CCAfiOASAGC AGAGGTGQTC ACCCTGATA6 
-5868 CAGCAAGCCT 6GGGGCT6CA GCTTGGTGGG TAGAGGTACT CAGGGGTACA GATGTCTCCA 
-5808 AACCTGTCCT GCTGCCTTAG GGAGCTTCTA ATAAGTTGAT GGATTTGGTT AAAATTAACT 
-5748 TGGCTACTTG GCAGGACTG6 GTC&G7GAGG ACCAACAAAA AGAAGACATC AGATTATACC 
-5688 CTGGGGGTTT GTATTTCTTG TGTTTCTTTC TCTTCTTTGT ACTAAAATAT TTACCCATGA 
-5628 CTGGGAAAGA GCAACTGGAG TCTTTGTAGC ATTATCTTAG CAAAAATTTA CAAAGTTTGG 
-5568 AAAACAATAT TGCCCATATT GTGTGGTGTG TCCTGTGACA CTCAGGATTC AAGTGTTGGC 
-5508 CGAAGCCACT AAATGTGAGA TGAAGCCATT ACAAGGCAGT GTGCACATCT GTCCACCCAA 
-5448 GCTGGATGCC AACATTTCAC AAATAGTGCT TGCGTGACAC AAATGCAGTT CCAGGAGGCC 
-5388 CAAATGAAAA TGTTTGTACT GAAATTTGTT AAAGCTTCCC GACAAACTAG ATTTATCAGT 
-5328 AAGGATTGTT TTCTGCAAGG GGGATGAAAC TTGTGGGGTG AGCCATTTGG GCTGAGGAGG 
-5268 AGGGAGGTTG GAGCTGAGAA ATGTGGAGAC AATTTCCCTT TAGAAGGACT GAATCTCCCT 
-5208 GCCTCTCTGG GGTGCGGCAG CCAGCAGGAT CCAATGGTGT ATATGTCTCC CCAGCTCCCC 
-5148 ATTCAGTGAT ATCATGTCAG TAGCTTOJUU TTATCCGTGG TGGGAGTATT ATGTCATGGA 
-5088 AATTGGCAAA TGGAAACTTT TA7TGGAGAT TCAATTGTTA AACTTTTACC AGCACAACAC 
-5028 TGCCCTGCCT TCAGAGTCAA TOACCCTATC CAAG7TTAAT CCATCTGTCC ACTGTCTCCA 
-4968 ACACGATCTT TATAAAACAC ACCTGACAAC ATTACCCTTT TATTCAGTTT TTTAAAAGAT 
-4908 AAGTTTCCAG CTCATCGGGG TGGCTTTAU GGCCATTTCT CCTCTGGACC TCACCCAACT 
-4848 TTTCAAATCA CTTTTCCTAC CCCTACCTCT AAATGCTACT CAAACTCCAG CCATCCTGAA 
-4788 TAATAAGACT TTTGAAAAGT AGATTATGGG CTGGGCACAG TGGCTCACAC CTGTAATCCC 
-4728 AGCACTTTGG GAGGCCAAGA TGGGTGGATC ACCTGAGGTC GGGAGTTCGA GACCAGCCTG 
-4668 ACTAACATAG TGAAACCCTG TCTCTACTAA AAATACAAAA TTAGTTGGGG GTGGTGGCAC 
-4608 AAGCCTGTAA TCCCAGCTAC TCAGGAGGTT GAGGCAGGGG AATTGCTTGA ACCTGQGAGG 
-4548 CGGAGGTTGC GGTGAGCCTA GATTGCTCCA CTGCACTCCA GCCTGGGCAA CAAGAGCGAA 
-4488 ACTCCATCTC AAAAAAATAA ATAAATAAAT AAAGTAGATT ACATCAGATA CCTCTOGCCT 
-4428 AGGTTGTTTA TGACCAACTC TCCTGCTGAG AATAACTAGA AAAGCTAGAC AAAACATATT 
-4368 TCCAAAAGAT CTCTTTGGAG GCATCAGAGA ATGGCCAAGG CTGTAAGGAA CTGCCTGAGC 
-4308 CCAGAGAGGT GGAGCCCAGC ACTGGTGCCC TTTACTCCTG GGGACATGTG CTGGTTTCAA 
-4248 AAACTTCAGC TGAGCTTTTG AGCATTCATG GAACTTGGTG GGGGAGATGA AATTTGTACC 
-4188 TTAAATCCTG CCTACAGGGA GGGTCCCTGA TAATCCCCAC CCAATTTGGA AATCTGGG7C 
-4128 AGCCTTCACA GGTACTGAAG CCCTCCTCTG AATGATCTCA AGTCCTGCTA GGGTAOAGGT 
-4068 TACCTGCTTT TGAAAGGCTC CTGGCCTACC TGTGCAGCAG GAGCAAAAGT GAACCATCTC 
-4008 AGGGTACAGA TAACAATCAT CCAGAGCCTT GAATGACCTC TACTGTGCTT AATATATAGT 
-3948 ATTCAGCAGT CAGTAAAAAG GATTTAGGCA CATGCAAGAT GACCTGTGTA TCAGGGAGAA 
-3888 ATAGGCAATA AATTGAGATC CAGCAGGGAT TTGAATCATG GATTTGAATC AGGGGCAGCC 
-3828 TTCGAAAGAA CTATGGAGAA TATACTCAGA TTTAAAACAT AAGATTGGAA TTTTTGGCAG 
-3768 AGAACTAACA ACTGTACAAA AAAGGAACCA AATGGAAATC CTAGAACTGA AAGATGCAAT 
-3708 TAACCGATGT TGAGAAATAG CCAACATCTA TTGAACACTT CCCATGTGGA CAGCTGTGCT 
-3648 AAACACTTTA CAGGCATCAA CATAAGATGT GTCCCCTTAC AGCAGTGCAG TGTCCCTCCT 
-3588 AAGACATGGA CAGCCTGGTT TCCCTATCTC TCTGCTTCAT CAAAACCCCT TTACGTGGGG 
-3528 CTTAGACACT CCTGTTGTCT CTAGTGTCTA GTAGCACAGG GCTCAGCACA TGGAAGCCAC 
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3468 TAGATACAA? TTGATGACCA QG&CCTCCGA TGAAAGCCAT GGGTGCTGAT TGGGAAGGCA 
3408 TTGTCTTTTA TGTGCTATGG TCTTAAAGCT TCATCCAGGA AflCAGAACTC GGGOOQTGCT 
3348 GAGGACCCAO AACCGAGAAT AAGATTAGTC AGAGATTTCC TGTGGGCAGA AATCATAAGG 
3288 ACGCCAACTG TTTGGGTGAG ATAAGACGAA ACCAAGAGTG GACTTGTGGC CAGAAGCGTG 
3228 AGGAAGAGGG AGAGAGCTTC CCTTGTCCCC TTTCTTCCTC TCCCTAAGCC ACAGTGATTG 
3168 ACAGCCCCCC CGCTTTGGAG TCAGAGCAGG CTTGAGACTG GACTGGGAAA GGAGGGTGGG 
3108 TCAGGATACA GAGCAGGAAG GCTGGGAGTG CAGGGCAGGA GCAAGGGGCT GGGGCATTCA 
3048 TTGTGCCTGA TCTCTCCCAC TTTACCTGGG GTAAAGAAGC ATATGCAAAA GCCACGGTGT 
2988 GAGTATTTCC CAAGTGCCAG GGTCAGGGCA TGATTCATCA CGTGCAGCAT TTCATTCAAT 
2928 CCTTATAGTA ACCGATGATG TGGCTTCTAT TATTAGCTCT ATCAGATAAT GAAACTGAGA 
2868 CCAAGACAGG CTCTGCACAT TGTGTGGGGT AATGACACAG GGGGATTCAG ACCTAGACTC 
2808 CATAACTCCT GCCCCAGGGA CCACCCCCAC CCTCACCCTG TGCATGTCGA CAAAGGACAG 
2748 ACTGGGCCAC TTCTCAGGAC ACAGCGGGG* AATGACACAG AGCAGGGAGG TTCCAGGAGC 
2688 CCCGAGCGTC TTTTCTCCAG GAGUTACTC TCTGAATTCA GACTGGGGTC AGAGAAACAT 
2628 TTACCCAGGA GCCGCAGTGT GGGTGGGGCT TTTTACTTGA AACGCTGTCT GAAGGCAGTG 
2568 GCAGGATGAA CTCTCCACCC TACCTTGGCA AGCCACTTCT CTTCTGCAAT CTGTAAGGAC 
2508 ATTGTTGAGA GAATTATGGT CTTCCAATTC CGGAGGGTTG AAGAAAGACA AATAGGAGAG 
2448 AACCTATCAT AGTCAGGTGC TAGCTGCCTT CTCTTTCAGA GAGTGTGAGA ATAAAGTGAT 
2388 ACACTTGATT ATTAGCAAAT ACTfTGGAAA TTTTAAACGC TAATATTCAA CACACTCTGG 
2328 AAGAGGCAAA TAAGTAGACA GGTTCATATA CATCATCTCC TTCAGCTAGT CCTCACAAU 
2268 ACAAACAAAT GAATAAACAA AATTCTTCTT TGGCCCTCAT AGGAAGACAC TGTTTCTTGA 
2208 ACGTGTTTCA AAAAGGATGG GTGACTCACT CAAGGTCACA CTGTTTATGA GGACAGTACA 
2148 GGAATACAGA CATGCCATTT TGCCTGAAAA AATCCATCAC CCAGGGAGGT GACACAATTT 
2088 TGCAGAAATG TTCTATTTCC TCTGAAGGAT ACATTCTTTA AACCTTTGGG AAATTCAWC 
2028 ATAGTCTTCC TCCTTTOAAO GATTACTCTC TGGACACAAA GTGTTTGATT CTGATTTGTT 
1968 GGTTGGAAGA TGTGTTGGTT GAGAGAAAGA TTCTGATTTG TTGGTTGAAA ATAGACTCAT 
1908 CAAGATCAAC TGCTGTAGTA GTAAATATTT TGACATTTTG TCTGTATTCC TGTGCTGCCC 
1848 TCACAAGCTG CATCACCTTG AGTGAGTCAT TCATACTTTT TTGTTTGTTT TTGTTTTGGA 
1788 GATGGAGTCT TACTCTGTTG CCTAGGCTGG AGfGCGGTGG CGTGATCTTG GCTCACTGCG 
1728 ACCTCCATCT CCTGGGTTCA AGTGATCCTC CTGCCTCAGC CTCCCGAGTA GCTGGGATTA 
1668 CAGGCACATG CCACCATCCC TGCTAATTTT TGCATTTTCA GTAGAGACGG AGTTTCACCA 
1608 TGTTGGTCAG GTTGOTCTTO AACTCCTGAC CTCAGGTGAT CCGCCCACCT CAGCCTCCCC 
1548 AAGTGCTGGG ATTACAOGTG TGAGCCACCG TGCCCAGCCC AGCCATCATT TTTGAAACAC 
1488 GTTTGAGAAA TAGTGTCTTC CTTTOAGGGC CAAGGAGACA TTTTTTTTGT TTATTTGTTT 
1428 GTTTTTGTGA GGACTAGCTG AAGGGGGTGA TGTATATTAA CCTGCCTACT TATTTGCCTC 
1368 TTCCCAGAGT GTGATGAATA TTAGGGTTTA AAGTTTCTGA AGCATTTGTT AATAAAGCCC 
1308 GGGGCTGGAG GTCAGAAGAC CTGGATTTCT CTGCATACTT TTGCCATCAG CAAGCTGTGT 
1248 GACCTTGGAC AGATCCCTTT TTTGTCTAAA TCTTTCTGAG TCTTCTTGAA AACAATGCCA 
1188 GGTTGGGACA GGATGATTGC CAAGCTCCCG TCCAGCTCTA AAACACTGCA ACGTATGCT7 
1128 CTGCACCAGC ACTGTCCATC CTGTAGATCA TGCAGAAATT CTCTTCAACT TTTTCCTACC 
1068 CATAAAATAG GAGCATGCTT ACCTTTTTCC TAATGTTCCA GGCCCCGGGT CTAGATATTG 
1008 TAAGTAAGGA AGTTAATGTG TATCAGAGCC CATTATGGGC CAGAAGTTCT CCTCTTCCTT 
-948 CCTACACCTG CTTCCTCCCT CCCTCCCTCC CTCTTTCCCT TCCTTCCTTC CATCCATTTG 
-888 TGAAGAAGAC ATGATCACCC TCATTCTGAG AGTGAAGAGA CAGAGGCTCA ACTAATGAAA 
-828 TGATTTGTTC AAGGTCACAC GGGTGGCACA AGGCAAGTGG CAGAGGTTGA ATTTAGACCC 
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-768 ATTCCTGTCC AAATGCTGAG TTTATGTCAT CGTCCCGAGA CCATAACTTT AAAGATGTAA 
-708 GATAGTGGGA AAAGAGTTGA TTTCAAAGCA CCTCTCAOAA GGACTCACTT TACATCAGGG 
-648 GTCAGCAGAC TCAGGCCAAA TCCGGTCCAT TCCCCGCTTT TGCAAAGAXA GTTGTAGTGG 
-588 AACACAGCTA GGCTTATTGA TTTATGGATT GCCAACGTCC TTTTGTGAAA CAGACAGCTG 
-528 AGCTGAGTAA TCGTGGCGCA CAAAACCTAA AATATTTACT ATCTCGTCCT TTACAGAATG 
-468 TTTGCCAATC TATGGTCCGG AGTCCAAGGC TGTCCATTTT TCAAAGAACA CAAAGTGACA 
-408 TGAGACTGTC CCATGTGCAG GGAGCCCTAT CATTTTATTA TGAAAAAACG GCCTTTCTGC 
-348 TCAAATCTGT TTTTTAAAAA GTCAACAAAC AGACTCTGGG TACCTGTCAG GAACAGTAGG 
-288 GAGTTTGGTT TCCATTGTGC TCTTCTTCCC AGGAACTCAA TGAAGGGGAA ATAGAAATCT 
-228 TAATTTTGGG GAAATTGCAC AGGGGAAAAA GGGGAGGGAA TCAGTTACAA CACTCCATTG 
-168 CGACACTTAG TGGGGTTGAA AGTGACAACA GCAAGGGTTT CTCTTTTTGG AAATGCGAGG 
-108 AGGGTATTTC CGCTTCTCGC AGTGGGGCAG GGTGGCAGAC GCCTAGCTTG GGTGAGTGAC 
-48 TATTTCTTTA TAAACCACAA CTCTGGGCCC GCAATGGCAG TCCACTGCTT GCTGCAGTCA 
13 CAGAATGGAA ATCTGCAGAG GCCTCCGCAG TCACCTAATC ACTCTCCTCC TCTTCCTGTT 
73 CCATTCAGAG ACGATCTGCC GACCCTCTGG GAGAAAATCC AGCAAGATGC AAGCCTTCAG 
133 GTAAGGCTAC CCCAAGGAGG AGAAGGTGAG GGTGGATCAG CTGGAGACTG GAAACATATC 
193 ACAGCTGCCA GGGCTGCCAG GCCAGAGGGC CTGAGAACTG GGTTTGGGCT GGAGAGGATG 
253 TCCATTATTC AAGAAAGAGG CTGTTACATG CATGGGCTTC AGGACTTGTG TTTCAAAATA 
313 TCCCAGATGT GGATAGTGCG ACCGGAGGGC TGTCTTACTT TCCCAGAGAC TCAGGAACCC 
373 AGTGAGTAAT AGATGCATGC CAAGGAGTGG GACTGCGATT CAGGCCTAGT TGAATGTGCT 
433 GACAGAGAAG CAGAGAGGGG CACCAGGGGC ACAGCCCGAA GGCCCAGACT GATATGGGCA 
493 AGGCCTGTCT GTGCTGACAT GTCGGAGGGT CCCACTCTCC AGGGACCTTG GTTTCCCCGT 
553 CTGTGACATC TGTGACATGA GAGTCACGAT AACTCCTTGT GTGCCTTACA GGGTTGTTGT 
613 GAAAATTAAA TGCACAGATA ATAGCGTAAC AGTATTCCGT GCATTGTAAA GAGCCTGAAA 
673 ACCATTATGA TTTGAAAATG GAATCGGCTT TGTGAGACCA TCACTATTGT AAAGATGTGA 
733 TGCTGATAGA AATGACAGGA CTGCTTGTGC ATGCCCTCTG CAGTGTGACA TTCCAGCAGT 
793 GAAATCATGT TGGGGTGACT TCTCCCCCAC TCTGACCTTT ATGTTTGTCT GGGCCOAGOC 
853 TGCAAGTCGG GCTCTGTGGG TGTATGAGTG ACAAGTCTCT CCCTTCCAGA TATGGGGACT 
913 GTCTGCTTCC CTAGGTTGCC TCTCCCTGCT CTGATCAGCT AGAAGCTCCA GGAGATCCTC 
973 CTGGAGGCCC CAGCAGGTGA TGTTTATCCC TCCAGACTGA GGCTAAATCT AGAAACTAGG 
1033 ATAATCACAA ACAGGCCAAT GCTGCCATAT GCAAAGCACT TTGGTTTGCC TGGCCACCCC 
1093 TCGTCGAGCA TGTGGGCTCT TCAGAGCACC TGATGAGGTG GGTACAGTTA GCCACACTTC 
1153 ACAGGTGAAG AGGTGAGGCA CAGGTCCCAG GTCAGGCTGG CCGGAGCTCT GTTTATTACG 
1213 TCTCACAGCT TTGAGTCCTG CTCTCAACCA GAGAGGCCCT TTACCAAGAA GAAAGGATTG 
1273 GGACCCAGAA TCAGGTCACT GGCTGAGGTA GAGAGGAAGC CGGGTTGTTC CCAAGGGTAG 
1333 CTGCTCCTGC AGGACTCTGA GCAGGTCACC AGCTAATGGA GGAAAGGCTC TAGGGAAAGA 
1393 CCCTTCTGGT CTCAGACTCA GAOCGAGTTA GCTGCAAGGT GTTCCGTCTC TTGAAACTTC 
1453 TACCTAGGTG CTATGGTAGC CACTAGTCTC AGGTGGCTAT TTAAATTTAT ACTTAAATGA 
1513 ATGAAAATAG AAGAAAATTT AAAATCCAGA CCCTTGGTCA CACTATCCAC ATTTAAAGAG 
1573 GTCAATAGCC ACATGTGGTT AGTGGCCACC CTATTGGGCA GTGCAGCTAC AGAACATTTT 
1633 TGCATCCCAG AAAGTTCTTT TGGATGTTGC TGCTCTACAG CATGCTTTGC TGAAACAGAA 
1693 GTGCCTTCCC TGGGAATCTC AGATGGGAAG CAAGTAAGGA GGGGAGTCAA ATGTGGGCTC 
1753 ACTGCTCACC AGCTGTGAGG GTTGGGCCTG CCTCTTAACC ATTGTCAGCC TCAGTCTTCT 
1813 CATCCATGCA TGCCGTGGGT ATACTAAAAT ACTATACCCC TGGAAGAGCT GGATGCAAAT 
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1873 TTGACAAGTT CTGGGGQACA CAOGAAGGTG CCAAflCACAA GGCTGGGCAC ATGGTGGCTG 
1933 TGCACTACAG CTQAGTCCTT WCCTTTTCA GAATCTGGGA TGTTAACCAG AAGACCTTCT 
1993 ATCTGAGGAA CAACCAACTA GTTGCTGGAT ACTTGCAAGG ACCAAATGTC AATTTAGAAG 
2053 GTGAGTGGTT GCCAGGAAAG CCAATGTATC TGGGCATCAC GTCACTTTGC CCGTCTGTCT 
2113 GCAGCAGCAT GGCCTGCCTG C1CAUCCCT AGGTGCAATG TCCTAATCCT TGTTGGGTCT 
2173 TTGTATTCAA GTTTGAAGCT GGGAGGGCCT GGCTACTGAA GGGCACATAT GAGGGTAGCC 
2233 TGAAGAGGGT GTGGAGAGGT AGAGTCTAGG TCAGAGGTCA GTGCCTATAG GCAAGTGGTC 
2293 CCAGGGCCAC AGCTGGGAAG GGCAAATACC AGAAGGCAAG GTTGACCATT CCCTTCCTCA 
2353 AGTGCCTATT AAGGCTCCAT GTTCCTATGT TGTTCAAACC CTAACTCAAT CCCAAATTAA 
2413 TCCACCATGT ATAAGGTTGA GCTATGTCTC TTATTCCTGG ACACCATACT CAGCCATATC 
2473 TGGTCCACAC ATTAACAGCT GGATGACCTT GAAGAAGCTT CACCCACTCT GTTCCTCAGC 
2533 TTTCCCTTCA GTGGGATGAT ATCAACTGGA CAACAGGATG TGCGATTCTT TTAGTTCCAG 
2593 CCTTCCAGGA TGTTTTCACT CCCCTGTTTG TTGTTGTAGG ATGGTATTAC CTCCACCTTC 
2653 CCACCTTCCC TATGCCCTGG TTCTGTCTCC TGTGCCTCGC TCTGAAAGTG GATGAGACCT 
2713 ACAATTCCTG TCCTGGTAGT TCTCCTAATG AACACACTGA AGCACGAGGA AGCTGAGATT 
2773 TTTGTTGCTA CATGAGAGCA TGGAGflCCTC TTAGGGAGAG AGGAGGTTCA GAGACTCCTA 
2833 GGCTCCTGGT GGAGCCCCAC TCATGGCCTT GTTCATTTTC CCTGCCCCTC AGCAACACTC 
2893 CTATTGACCT GGAGCACAGG TATCCTGGGG AAAGTOAGGG AAATATGGAC ATCACATGGA 
2953 ACAACATCCA GGAGACTCAG GCCTCTAGGA GTAACTGGGT AGTGTGCATC CTGGGGAAAG 
3013 TGAGGGAAAT ATGGACATCA CATGGAACAA CATCCAGGAG ACTCAGGCCT CTAGGAGTAA 
3073 CTGGGTAGTG TGCATCCTGG GGAAAGTGAG GGAAATATGG ACATCACATG GAACAACATC 
3133 CAGGAGACTC AGGCCTCTAG GAGTAAC7GG GTAGTGTGCA TCCTGGGGAA AGTOAGGGAA 
3193 ATATGGACAT CACATGGAAC AACATCCAQG A0AC7CAGGC CTCTAGGAGT AACTGGGTAG 
3253 TGTGCTTGGT TTAATCTTCT ATTTACCTGC AQACCAGGAA GATGAGACCT CTCTGCCCTT 
3313 CTGACCTCGG GATTTTAGTT TTGTGGGGAC CAGGGGAGAT AGAAAAATAC CCGGGGTCTC 
3373 TTCATTATTG CTGCTTCCTC TTCTATTAAC CTGACCCTCC CCTCTGTTCT TCCCCAGAAA 
3433 AGATAGATGT GGTACCCATT GAGCCTCATG CTCTGTTCTT GGGAATCCAT GGAGGOAAGA 
3493 TGTGCCTGTC CTGTGTCAAG TCTGGTGATG AGACCAGACT CCAGCTGGAG GTAAAAACAT 
3553 GCTTTGGATC TCAAATCACC CCAAAACCCA GTGGCTTGAA ACAACCAAAA TTTTTTCTTA 
3613 TGATTCTGTG GGTTGACCAG GATTAGCTGG GTAGTTCTGT TCCATGTGGT GGAACATGCT 
3673 GGGGTCACTT TGGAAGCTGC ATTCAGCAGA GTGCCAGGCT TGCGCTGGGC ATCCAAGGTG 
3733 GTCCCTCATC CTCCAGGCTC TCTTTCCATG TGATCTCTCA GTGTTTAAGA GTTAGTTGGA 
3793 GCTTCCTTAC AGCATGGCGG CTGACTTCCA AAAGGGATTA TTCCAAAAAG AGCCTCAACA 
3853 TGCAGGCGCT TATTATGACT TCTGCTTGCA TCATCCTATT GGCCAAAGCC AGTCACGTGG 
3913 CTAAGTCTAG CCCCCTGTGA GAGGAGACTG CATAAGAGTG TGAACACCAG GAGACACGG7 
3973 CACTGGGGGC CACCACTGTA ACCATCTACC ACAGGACCTG AATCTCTGTG TGCTACTCCC 
4033 TTGCTCAAGG GCCCCCCTAC CCACGCAGAC CTGCTGTCTT CTAGCAAAGC CCATCCTCAG 
4093 GACCTTTCTC TTCCAATCCT TATTGACTCA AATTGATTAG TTGGTGCTCC ACCCAGAGCC 
4153 CTGTGCTCCT TTATCTCATG TAATGTTAAT GGGTTTCCCA GCCCTGGGAA AACATGGCTT 
4213 TGTCTCAGGG GCTTGCTGGA TGCAACCTTA ACCTCAATGT GAGTGGCCAT ACTGTGOCAC 
4273 TGTCCCATCC CTCACCAGGG ACACTGTTCT GGAGGGTGAC TGCCTGTTCT GTQAGGAGTG 
4333 GGGATGGCTA GGACATTGCA TGGAACACAC CACCACCCCA TCTTCTCAGA GCTCAAACCC 
4393 TGACAGAACA CCAGCTCCAC AGGCCTTGGC TTCTGCTGAT GGTGCCGTGT ATTTACCAGA 
4453 CTTAGTGGTC CAAGGCCAGA GTGGCAGATT TCCCAAAGTC AAGGTGTGAC AGTGGGACAG 
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4513 CCTCTTTGTG TCTTTOCTGT CCTAAQAAAC CTGGGCCAGG CCAGGCGCAG TGGCTCACGC 
4573 CTTGTAATCC CAGCACTT7G AOAGGCCAAG GTGGGCAGAT CACGAGGTCA GGAGTTTGAG 
4633 ACCAGCCTGG CCAACATTGG TQAAACCCfG TCTCTATTAA AAATAGAAAA CATTAGACAG 
4693 GTGTGGTGGT GCATGCCTGT AATCCCAGCT ACTCAGGAGG CTGAGGCAGG AGAATCGCTT 
4753 GAACCCAGGA GGTGGAGGTT GCAGTOAGCC GAGATTGTGC CACTGCACTC CAGCCTAGGC 
4813 GACAGAGCAA GACTCCGTC7 CGGGAAAA77 AA7TAATAAA TAAATAAACC TAGGTCCCAG 
4873 AGTCCCACAG AATGGCAGAC AGGAGCACCT GGGGGCTTTT AGGGTATGGC ATTTCCCCTG 
4933 TACTAACTCT GGGCTGTCCA GAGGCGA7T7 CATGGCGTGG AGTGGAGAGG GAGGCAGCAC 
4993 AGGACTTCCT AGGCCTCAGC TCTCACCTGC CCATCTTTTG ATTTCCAGGC AGTTAACATC 
5053 ACTGACCTGA GCGAGAACAG AAAGCAGGAC AAGCGCTTCG CCTTCATCCG CTCAGACAGT 
5113 GGCCCCACCA CCAGTTTTGA GTCTGCCGCC TGCCCCGGTT GGTTCCTCTG CACAGCGATG 
5173 GAAGCTGACC AGCCCGTCAG CCTCACCM? ATGCCTGACG AAGGCGTCAT GGTCACCAAA 
5233 TTCTACTTCC AGGAGGACGA GTAGTAC7GC CCAGGCCTGC CTGTTCCCAT TCTTGCATGG 
5293 CAAGGACTGC AGGGACTGCC 107CCCCCTO CCCCAflflGCT CCCGGCTATG GGGGCACTGA 
5353 GGACCAGCCA TTGAGGGGTG GACCCTCAGA AGGCGTCACA ACAACCTGGT CACAGGACTC 
5413 TGCCTCCTCT TCAACTGACC AGCCTCCATG CTGCCTCCAG AATGGTCTTT CTAATGTGTG 
5473 AATCAGAGCA CAGCAGCCCC TGCACAAAGC CCTTCCATGT CGCCTCTGCA TTCAGGATCA 
5533 AACCCCGACC ACCTGCCCAA CCTOCTCTCC 7CTTGCCACT GCCTCTTCCT CCCTCATTCC 
5593 ACCTTCCCAT GCCCTGGATC CATCAQGCCA CTTGATGACC CCCAACCAAG TGGCTCCCAC 
5653 ACCCTGTTTT ACAAAAAAGA AAAGACCAGT CCATGAGGGA GGTTTTTAAG GGTTTGTGGA 
5713 AAATGAAAAT TAGGATTTCA TOATTTTTTT TTTTCAGTCC CCGTGAAGGA GAGCCCTTCA 
5773 TTTGGAGATT ATGTTCTTTC OfflVUftfrflGC TGAGGACTTA AAATATTCCT GCATTTGTGA 
5833 AATGATGGTG AAAGTAAGTG GTAGCTTTTC CCTTCTTTTT CTTCTTTTTT TGTGATGTCC 
5893 CAACTTGTAA AAATTAAAAG TTATGGTACT ATGTTAGCCC CATAATTTTT TTTTTCCTTT 
5953 TAAAACACTT CCATAATCTG GACTCCTCTG TCCAGGCACT GCTGCCCAGC CTCCAAGCTC 
6013 CATCTCCACT CCAGATTTTT TACAGCTGCC TGCAGTACTT TACCTCCTAT CAGAAGTTTC 
6073 TCAGCTCCCA AGGCTCTGAG CAAATGTGGC TCCTGGGGGT TCTTTCTTCC TCTGCTGAAG 
6133 GAATAAATTG CTCCTTGACA TTGTAGAGCT TCTGGCACTT GGAGACTTGT ATGAAAGATG 
6193 GCTGTGCCTC TGCCTGTCTC CCCACCAGGC TGGGAGCTCT GCAGAGCAGG AAACATGACT 
6253 CGTATATGTC TCAGGTCCCT GCAGGGCCAA GCACCTAGCC TCGCTCTTGG CAGGTACTCA 
6313 GCGAATGAAT GCTGTATATG TTGGGTGCAA AGTTCCCTAC TTCCTGTGAC TTCAGCTCTG 
6373 TTTTACAATA AAATCTTGAA AATGCCTATA TTGTTGACTA TGTCCTTGGC CTTGACAGGC 
6433 TTTGGGTATA GAGTGCTGAO GAAACTGAAA GACCAATGTG TYTTYCTTAC CCCAGAGGCT 
6493 GGCGCCTGGC CTCTTCTCTO AGAGTTCTTT TCTTCCTTCA GCCTCACTCT CCCTGGATAA 
6553 CATGAGAGCA AATCTCTCTG CGGGG 
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